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RELATED APPLICATIONS 

This application is a continuation in part of Serial No. 09/419,568, filed October 18, 1999, 
which is a continuation in part of Serial No. 09/354,243, filed on July 16, 1999, which in turn is a 
continuation in part of Serial No. 09/178,973, filed October 26, 1998. All of these applications are 
incorporated by reference in their entirety. 

FIELD OF THE INVENTION 

This invention relates to newly isolated nucleic acid molecules and their uses. The nucleic 
acid molecules are shown to be upregulated by the cytokine interleukin-9 ("IL-9"). Also disclosed 
are the proteins encoded thereby. These molecules have been described as "T Cell Derived 
Inducible Factors" or "TIFs"; however, as of now, they are known as "interleukin-21" or "IL-21", 
as well as "IL-TIF." For this reason, the terms will be used interchangeable herein. These nucleic 
acid molecules encode proteins which induce STAT activation in cells. They can be used, for 
example, in the stimulation of regeneration of targeted tissues . Further, their inhibitors or 
antagonists can be used to retard, prevent or inhibit differentiation of other tissues. 

BACKGROUND AND PRIOR ART 

The last decade has seen knowledge of the immune system and its regulation expand 
tremendously. One area of particular interest has been that of research on the proteins and 
glycoproteins which regulate the immune system. One of the best known families of these 
molecules are the cytokines. These are molecules which are involved in the "communication" of 
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cells with each other. The individual members of the cytokine family have been found to be 
involved in a wide variety of pathological conditions, such as cancer and allergies. Whereas 
sometimes the cytokines are involved in the pathology of the condition, they are also known as being 
therapeutically useful. 

5 Interleukins are one type of cytokine. The literature on interleukins is vast. An exemplary, 

but by no means exhaustive listing of the patents in this area includes U.S. Patent No. 4,778,879 to 
Mertelsmann et al.; U.S. Patent No. 4,490,289 to Stern; U.S. Patent No. 4,518,584 to Mark et ah; 
and U.S. Patent No. 4,851,512 to Miyaji et al., all of which involve interleukin-2 or "IL-2." 

i; 3 Additional patents have issued which relate to interleukin-1 ("IL-1"), such as U.S. Patent No. 

19 4,808,61 1 to Cosman. The disclosure of all of these patents are incorporated by reference herein. 
P More recent patents on different interleukins include U.S. Patent Nos. 5,694,234 (IL-1 3); 5,650,492 

(IL-12); 5,700,664, 5,371,193 and 5,215,895 (IL-11); 5,728,377, 5,710,251, 5,328,989 (IL-10); 
J 3 5,580,753, 5,587,302, 5,157,112, 5,208,218 (IL-9); 5,194,375, 4,965,195 (IL-7); 5,723,120, 

ry 5,178,856 (IL-6), and 5,017,691 (IL-4). Even a cursory review of this patent literature shows the 

0 diversity of the properties of the members of the interleukin family. One can assume that the larger 

cytokine family shows even more diversity. See, e.g., Aggarwal et al., ed., Human Cytokines: 
Handbook For Basic And Clinical Research (Blackwell Scientific Publications, 1992), Paul, ed., 
Fundamental Immunology (Raven Press, 1 993), pg 763-836, "T-Cell Derived Cytokines And Their 
Receptors", and "Proinflammatory Cytokines and Immunity." All cited references are incorporated 

20 by reference. 
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The relationships between various cytokines are complex. As will be seen from the 
references cited herein, as the level of a particular cytokine increases or decreases, this can affect the 
levels of other molecules produced by a subject, either directly or indirectly. Among the affected 
molecules are other cytokines. 

The lymphokine IL-9 , previously referred to as "P40," is a T cell derived molecule which was 
originally identified as a factor which sustained permanent antigen independent growth of T4 cell 
lines. See, e.g., Uyttenhove et al, Proc. Natl. Acad. Sci. 85: 6934 (1988), and Van Snick et al, J. 
Exp. Med. 169 : 363 (1989), the disclosures of which are incorporated by reference, as is that of 
Simpson et al., Eur. J. Biochem. 183: 715 (1989). 

The activity of IL-9 was at first observed on restricted T4 cell lines, failing to show activity 
on CTLs or freshly isolated T cells. See, e.g., Uyttenhove et al ., supra , and Schmitt et al., Eur. J. 
Immunol. 19: 2 1 67 (1 989). This range of activity was expanded when experiments showed that IL-9 
and the molecule referred to as T cell growth Factor EI ("TCGF III") are identical to MEA (Mast 
Cell Growth Enhancing Activity), a factor which potentiates the proliferative response of bone 
marrow derived mast cells to IL-3, as is described by Hiiltner et al., Eur. J. Immunol, and in U.S. 
Patent No. 5,164,317, the disclosures of both being incorporated by reference herein. It was also 
found that the human form of IL-9 stimulates proliferation of megakaryoblastic leukemia. See Yang 
et al, Blood 74: 1880 (1989). Work on IL-9 has shown that it also supports erythroid colony 
formation (Donahue et al., Blood 75(12): 2271-2275 (6-15-90)); promotes the proliferation of 
myeloid erythroid burst formation (Williams et al., Blood 76: 306-3 1 1 (9-1-90); and supports clonal 
maturation of BFU-E r s of adult and fetal origin (Holbrook et al, Blood 77(10): 2129-2134 (5-15- 
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91)). Expression of IL-9 has also been implicated in Hodgkins's disease and large cell anaplastic 
lymphoma (Merz et al., Blood 78(8): 1311-1317 (9-1-90). Genetic analyses of mice that were 
susceptible or resistant to the development of bronchial hyperresponsiveness have unraveled a 
linkage with the EL-9 gene as well as a correlation between IL-9 production and susceptibility in this 
model micolaides et al.. Proc. Natl. Acad. Sci. USA, 94, 13175-13180, 1997). Human genetic 
studies also point to the IL-9 and IL-9R genes as candidates for asthma (Doull et al., Am. J . Respir. 
Crit. Care Med., 153, 1280-1284, 1996; Holrovd et al. . Genomics 52, 233-235, 1998). Also, IL-9 
transgenic mice allowed for the demonstration that increased EL-9 expression result in lung 
mastocytosis, hypereosinophilia, bronchial hyperresponsiveness and high levels of IgE ( Temannet 
al, J. Exp. Med. 188, 1307-1320, 1998; Godfraind et al. . J. Immunol. 160, 3989-3996, 1998; 
McLane et al. . Am. J. Resp. Cell. Mol. 19:713-720 (1999). Taken together, these observations 
strongly suggest that IL-9 plays a major role in this disease Additional work has implicated IL-9 and 
muteins of this cytokine in asthma and allergies. See, e.g. PCT Application US96/12757 (Levitt, 
et al), and PCT US97/21992 (Levitt, et al), both of which are incorporated by reference.. 

IL-9 is known to affect the levels of other molecules in subjects. See Louahed et al., J. 
Immunol. 154: 5061-5070 (1995; Demoulin et al. . Mol. Cell. Biol. 16: 4710-4716 (1996), both 
incorporated by reference. It will be recognized that the molecules affected have their own functions 
in biological systems. For example, Demoulin etal. show that many of the known activities of IL-9 
are mediated by activation of STAT transcription factors. As such, there is continued interest in 
trying to identify molecules whose presence and/or level is affected by other molecules, such as 
cytokines. 



6601591 



LUD 5664 JEL/NDH 



The disclosure which follows describes such molecules. It was found that nucleic acid 
molecules encoding the proteins of the invention were expressed in the presence of IL-9, but not in 
its absence. Hence, these molecules are, inter alia, "markers" for the expression or effect of IL-9 in 
a subject. The molecules once referred to as T Cell Derived Inducible Factors or "Tffs" have now 
been referred to as interleukin-2 1 , or "IL-2 1 ." These and other features of the invention will be seen 
in the disclosure which follows. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Example 1 

The murine lymphoma cell line BW5147 is well known as a cell line which can be grown 
in vitro, without the need to add any cytokines to its culture medium. In order to identify genes 
induced by IL-9, samples of BW5 147 were cultured either with (200 U/ml), or without IL-9, for 24 
hours. Then, total RNA was isolated, using guanidium isothiocyanate lysis, and CsCl gradient 
centrifugation. These techniques are well known in the art. Following this, polyadenylated RNA 
was purified from the total RNA, by using an oligo(dT) cellulose column. The isolated, polyA RNA 
was then used to generate double stranded cDNA. A commercially available oligo(dT) primer was 
used. Anywhere from 3-5 ug ofpolyA RNA were heated to 70 °C for 10 minutes with lugofoligo 
(dT), and then incubated with 5x first strand buffer (250 mM HC1 (pH 8.3), 375 mM KC1, 15 mM 
MgCl 2 ), 10 mM dithiothreitol, 500 uM of deoxynucleotide triphosphates, and 800 U of reverse 
transcriptase. Total volume of the reaction mixture was 20 ul, and the reaction was allowed to 
proceed at 37 °C for one hour. This resulted in synthesis of the first strand of cDNA. Second strand 
synthesis was accomplished by adding 30ul of 5 second strand buffer (1 OOmM Tris-HCl (pH 6.9)), 
450mM KC1, 23mM MgCl 2 , 0.75mM (3-NAD + , 50mM (NH 4 ) 2 S0 4 , together with 60U of E. coli 
derived DNA polymerase I, 2U of E. coli RNase H, 10 U of E. coli DNA ligase, and 250 uM of 
deoxynucleotide triphosphates, and brought to a final volume of 1 50 ul. The mixture was incubated 

for two hours, at 16°C. 

The product was extracted using phenol-chloroform, and was precipitated with ethanol. The 

final cDNA product was then resuspended in 200 ul of TE. 
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These steps were carried out for both the stimulated BW5147 cells ("tester" hereafter), and 
for parallel, unstimulated BW5147 cells ("driver" hereafter). 

Example 2 

The cDNA prepared in Example 1 was then subjected to subtraction cloning in accordance 
5 with well known methods. To do this, six oligonucleotides were prepared: 

5*-AGCACTCTCC AGCCTCTCAC CGCA-3 (SEQ ID NO: 1); 

5'-GATCTGCGGT GA-3' (SEQ ID NO: 2); 
M 5'-ACCGACGTCG ACTATCCATG AACA-3' (SEQ ID NO: 3); 

j$ 5-GATCTGTTCA TG-3' (SEQ ID NO: 4); 

m 5'-AGGCAACTGT GCTATCCGAG GGAA-3' (SEQ ID NO: 5); and 

H 5'-GATCTTCCCT CG-3' (SEQ ID NO: 6). 

L These were used as explained herein. Double stranded cDNA (2 ug), was digested with restriction 

ll endonuclease DpnII, extracted with phenol-chloroform, precipitated with ethanol, and resuspended 

□ in 20 ul of TE (lOmM Tris-HCl (pH 7.5); ImM EDTA). Twelve ul (1.2 ug), of cut cDNA was 

ligated to double stranded SEQ ID NOS: 1 and 2, in a mixture which included 4 ul of desalted SEQ 
ID NO: 1 (2mg/ml), 4 ul desalted SEQ ID NO: 2 (1 mg/ml), lOul of 5X adapter buffer (330mM 
Tris-HCl, pH 7.6, 50mM MgCl 2 , 5mM ATP), 7ul DTT (lOOmM), and 28ul of H 2 0). The 
20 oligonucleotides were annealed to each other and to the sample DNA by heating the mixture to 50° C 
and then cooling it to 10°C over one hour, followed by adding 5ul of T4 DNA ligase, and incubation 
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for 12-14 hours, at 12-16°C. The mixtures were diluted by adding 140 ul of TE. PCR was then 
carried out on 200 ul samples, as described infra . 

Example 3 

To carry out PCR, 200 ul samples containing 2 ul of the ligation product in a buffer of 66 
mM Tris-HCl, pH 8.8, 4 mM MgCl 2 , 16 mM (NH 4 ) 2 S0 4 , 33 ug/ml BSA, 0.3 mM of each dNTP 
(concentration: 500 uM), and 2 ug of SEQ ID NO: 1 were first heated at 72 °C for three minutes to 
remove any of SEQ ED NO: 2 which was hybridized to the product of Example 2. The 3' ends were 
then filled in by using 5U of Taq polymerase (5 minutes, 72°C). Twenty cycles of amplification 
were carried out (1 cycle:l minute at 95°C, and three minutes at 72°C), after which products were 
combined, phenol extracted, ethanol precipitated, and resuspended in TE buffer, at a concentration 
of 0.5 ug/ul. Hereinafter, this is referred to as the representation. 

Example 4 

The representation was then prepared for subtractive hybridization by removing SEQ ID NO: 
1 therefrom by digestion with Dpn H. The resulting digest was phenol extracted and ethanol 
precipitated. In the case of the unstimulated sample, this resulted in the driver, while the stimulated 
sample resulted in the tester. Portions of tester (20 ug) were gel purified on a 1 .2% agarose gel and 
isolated. Samples (2 ug), were ligated to SEQ ID NOS: 3 and 4, in the same way that SEQ ID NOS: 
1 and 2 were ligated, as described, supra . 
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In a first cycle of subtractive hybridization, 0.4 ug samples of tester with SEQ ID NOS: 3 
and 4 ligated thereto were mixed with 40 ug of driver cDNA. The mixture was phenol extracted, 
ethanol precipitated, dissolved in 2 ul of 3XEE buffer (30mM EPPS pH 8.0), 3mM EDTA; pH 8.0, 
3 mM EDTA. This was overlaid with 30 ul of mineral oil, and denatured for five minutes at 98 °C. 
5 A 5M NaCl solution (0.5 ul) was added, and DNA was hybridized for 20 hours, at 67 °C. The 

reaction mixture was diluted to 200 ul with TE, and tRNA carrier. The samples were incubated for 
three minutes at 72 °C to melt away SEQ ID NO: 4, and then four PCR reactions (200 ul) were 
prepared. These included 20 ul of diluted hybridization mix without primer, to fill in the ends of 
„ the reannealed tester, followed by 10 cycles of amplification after adding samples of SEQ ID NO: 
m 3 (1 cycle:lminute at 95 °C, three minutes at 70°C) after which products were combined, phenol 
B extracted, ethanol precipitated, and resuspended in 40 ul of 0.2XTE buffer. Single stranded DNA 
H was degraded by a 30 minute treatment of 20 ul of this material with 20U of mung bean nuclease, 
L at a total volume of 40 ul. Samples was diluted (1 :5), in 50 mM Tris-HCl, at pH 8.9, followed by 
nl five minutes of heating at 98 °C to inactivate the enzyme. A second PCR was carried out, using 20 

O ul of the product described supra . 2 ul of SEQ ID NO: 3 (1 mg/ml), and 1 ul (5 U) of Taq DNA 

polymerase. A total of 18 cycles (1 cycle:l minute at 95°C, three minutes at 70°C) were carried out. 
Products were combined, phenol extracted, ethanol precipitated, and resuspended at 0.5-1 ug/ul. 
The product is referred to hereafter as "DPI ", or the first difference product. 
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Example 5 

DPI was then digested with endonuclease DpnII, as described above, and was ligated to SEQ 
ID NOS: 5 and 6, following the same processes described for SEQ ID NOS: 1, 2, 3 and 4. 
Subtractive hybridization and selective amplification, as described in example 4, was repeated, and 
second difference product, or "DP2", was generated. In these experiments, 50 ng of DPI was the 
tester. The driver (40 ng), was as described supra . The process was repeated to generate a third 
difference product, using SEQ ID NOS: 3 and 4 as adapters. To generate the third product, 100 pg 
of tester were mixed with 40 ug of driver. All steps of the protocols supra were repeated, except 
the final amplification was carried out for 22 cycles, where one cycle was one minute at 95 °C, and 
three minutes at 70°C. This yielded the final difference product. 



Example 6 

The final difference products were digested with DpnII, and then cloned into the BamHI site 
of a commercially available vector, i.e., pTZl 9R. Double stranded DNA plasmids were prepared, 
and then sequenced, using standard methods. The sequences were compared to known sequences 
in the GenBank and EMBL data bases, using a BLAST search program. 

At the end of this subtraction procedure, a short cDNA fragment was identified, i.e., a 
fragment about 200 base pairs long. This fragment was used to screen a cDNA library from BW 
5147 cells. The largest clone was sequenced. It is discussed infra. It does not correspond to any 
known sequence. 



660159.1 



11 



LUD 5664 JEL/NDH 



The nucelotide sequence (SEQ ID NO: 7), is 1 1 1 9 bases long, including a 537 base pair open 
reading frame, which encodes a protein 179 amino acids long. The predicted molecular weight of 
the protein is 20,093. There are two additional ATG codons which, if they acted as start codons, 
would produce proteins 172 and 167 amino acids in length, with molecular weights of 19,335 and 
5 1 8,770 daltons, respectively. Each form of the protein is characterized by a sequence of hydrophobic 

amino acids which would be cleaved off of the molecule via the endoplasmic reticulum to provide 
a mature protein. 

Analysis of the sequence shows three AT rich motifs (TTATTTAT). These motifs are often 
found in 5'- untranslated regions of cytokines and oncogenes. Kruvs. et al. . Science 245: 852(1989), 
f I have shown that these repeats modulate stability of rnRNA. 

= !i Example 7 

* a The cDNA isolated and analyzed in example 6, supra , was then used as a probe to identify 

fii genomic DNA for TIFa. 

p A genomic library prepared from mouse strain 129 was screened with SEQ ID NO: 7, 

1 5 following standard methods. An EcoRI fragment from a positive clone was subcloned into plasmid 

pZERO and partially sequenced. The partial sequence is presented as SEQ ID NO: 8. 

Example 8 

A second EcoRI fragment from the positive clone described in Example 7, supra, was also 
subcloned. There was a great deal of homology, but the sequences were not identical. To be 
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specific, intron 1 of this sequence was 98% identical to SEQ ID NO: 8, intron 2 was 100% identical 
and intron 3 was 92% identical. 

What is striking about the sequences is that the promoters are not at all homologous, 
suggesting independent regulation. The 5' untranslated regions are 92% identical. The first exon 
for TIFa is split into exon la and exon 1 p. The first coding exon (which is exon lb for TIFa and 
exon 1 for TIFp) are 99.5% identical, while the second exons are 100% identical, the third exons 
97% identical, the fourth exons 98.5% identical, and 96% for the fifth exon. In the untranslated 3' - 
region, homology is 96%. 

Example 9 

Using the information described in example 8, supra , a cDNA sequence for the second clone, 
designated TIFp was deduced, and is set forth as SEQ ID NO: 9. The genomic DNA sequence was 
also ascertained, in the same manner as is described, supra, and is set forth as SEQ ID NO: 42. The 
amino acid sequence is SEQ ID NO: 41. 

As compared to the coding region for TIFa, that of TIFp has six silent changes. There are 
two changes which result in an inconsequential amino acid change (at both of positions 36 and 103, 
Val in TIFa becomes He in TIFp). There is also a more significant change, at position 1 12, where 
Gin becomes Arg. 

Example 10 
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Experiments were undertaken to study expression of the TTFs. BW 5147 cells were 
stimulated with recombinant murine IL-9 (200U/ml), for varying periods of time (0.2, 0.5, 1, 2 & 
24 hours). Total RNA was then isolated, using standard methods and reagents. Reverse 
transcription was then carried out, using 5ug total RNA and an oligo (dT) primer. Samples of 
5 cDNA corresponding to 20ng of total RNA were then amplified for 25 cycles using different 
primers. (One cycle was 4 minutes at 94 °C, 1 minute at 57°C, and 2 minutes at 72°C). The TIF 
primers were: 

5'-CTGCCTGCTT CTCATTGCCC T-3' (SEQ ID NO: 10) 
f =i and 

U 5-CAAGTCTACC TCTGGTCTCA T-3' (SEQ ID NO: 1 1) 

m (sense and antisense, respectively). 

H These correspond to nucleotides 107-127, and 766-786 of SEQ ID NO: 7, respectively. As 

L a control, (3-actin was amplified as well, for 1 8 cycles (first cycle: 4 minutes at 94°C, 1 minute at 

m 60°C, 2 minutes at 72°C. Succeeding cycles were 1 minute at 94°C, 1 minute at 60°C, 2 minutes at 

13 72°C). 

Following amplification, post PCR products were analyzed on a 1 % agarose gel, and specific 
amplification was confirmed, following blotting, using internal radioactive probes. The probe for 
TIF was: 

5'-GACGCAAGCA TTTCTCAGAG-3' (SEQ ID NO: 12) 
20 the conditions and probes set forth were not specific for one or the other of the forms of TIF; 

however, the amplification product of TIFa contains a Kpnl restriction site, while the restriction site 
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for TIFp does not. Digestion of the amplification products with Kpnl indicated that most, if not all, 
of the TIF mRNA induced by IL-9 was TIFa, suggesting that the TIFa expression was induced 
rapidly via the DL-9. The mRNA for TIFa was detectable after 30 minutes of stimulation, and 
reached a plateau over a 1-24 hour time period. 

Example 1 1 

Experiments were then carried out which showed that the induction of TIF mRNA by IL-9, 
described supra, does not require protein synthesis. In these experiments, total RNA was extracted 
from cells stimulated for 24 hours, as described in example 10, but with or without lOug/ml of a 
protein synthesis inhibitor, cycloheximide, for 4.5 hours. In a parallel set of experiments, cells were 
not stimulated. The total RNA was extracted, and RT-PCR amplification was carried out as 
described in example 10. Post-PCR products were analyzed on an ethidium bromide-stained, 1% 
agarose gel. What was seen was that the induction by IL-9 still occurred when protein synthesis was 
blocked. Hence, the effect of IL-9 is a direct effect, not requiring the synthesis of a protein mediator. 

Example 12 

In these experiments, the role of STAT proteins in induction of TIF mRNA was studied on 
derivatives of the cell line BW5147. The first line, BWh9R, expresses wild type human IL-9 
receptors. The line BW-Phel 16 is a transfectant with a single mutation (at position 1 16), which 
renders the receptor unable to activate STAT transcription factors. Still another cell line, BW-mut6, 
has a mutation which renders the receptor unable to activate STAT5, while retaining the ability to 
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activate STAT1 and STAT3. Finally, cell line BW-mut7 has a single mutation which renders the 

IL-9 receptor unable to activate STAT1 and STAT3,but which retains the ability to activate STAT5. 
Cell stimulation, isolation of total RNA, reverse transcription and amplification of cDNA 

were all carried out as described in example 10 (Cells were stimulated for 24 hours. Both human 
5 and murine recombinant IL-9 were used). The PCR products were analyzed on an ethidium bromide 

stained, 1% agarose gel, as describe supra . 

The analysis revealed that human IL-9 did not induce expression in BW-Phel 1 6, suggesting 

that STAT transcription factors are implicated. It was found that JL-9 induced TIF expression in the 
, = BW-mut6 mutant, but not the mut7 variant, suggesting that STAT 1 or ST AT3 are involved, but not 

|I STAT5. 

j & Example 13 

|; m The expression of TIF mRNA in normal mouse spleen cells was then studied. 

f jl Spleen cells from 10-12 week old Balb/c mice were cultured for 24 hours in control medium 

Q or the control medium supplemented with 20ug/ml of LPS (which activates B lymphocytes and 

15 macrophages), or ConA (which activates T cells), or ConA plus 1 % of a blocking antiserum against 

murine IL-9, with |3 actin being used as a control. Purification of RNA, RT-PCR analysis were 

carried out as described supra . 

The data indicated that TIF is, at best, very weakly expressed in resting spleen cells, not 

induced by LPS, but strongly induced by ConA. Anti IL-9 antiserum did not affect induction by 
20 ConA, suggesting that its effect is not mediated by IL-9, or is mediated by other cytokines. 
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When the ConA activated spleen cells were analyzed using sequences of RT-PCR products, 
it was found that these cells were expressing TIFa predominantly, or exclusively. 

Example 14 

Further experiments showed that TIF mRNA was expressed even in the absence of IL-9 
5 induction. 

Spleen cells from 5 week old FVB mice were enriched for T cells, using a nylon wool 
column. Then, the cells were stimulated for 24 hours in medium supplemented with ConA (a T cell 
activator), or PMA (which activates PKC in most cells), either with or without EL-9. 
j-S Total RNA was isolated using standard techniques, and then ten microgram samples were 

1:6 fractionated via electrophoresis on a 1 .3% agarose gel containing 2.2M formaldehyde. The fractions 

^ were then transferred to a nitrocellulose membrane, labeled, and assayed in a hybridization assay 
;U following Van Snick, et al, J. Exp. Med. 169: 363 (1989), incorporated by reference. 

r y The results indicated that the induction of TIF by ConA was not modified, and that IL-9 did 

O not induce TIF RNA in PMA activated spleen cells. 

15 Example 15 

The expression of TIF mRNA in various cell lines was tested. In these experiments, murine 
cell lines were stimulated for at least one day, with a particular cytokine. Specifically, 9T7 is a 
T cell lymphoma, which responds to IL-2, IL-4 or IL-9. Cell lines TS3 and TS6 are derived from 
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T helper cell clones, and proliferate in the presence of either IL-2 or IL-9. MC9 and LI38 are mast 
cell lines, which proliferate in the presence of either IL-3 or IL-9. 

Following stimulation, total RNA was prepared using standard guanidium isothiocyanate 
lyses, and CsCl gradient centrifugation. 
5 The 9T7 line was then analyzed by Northern blotting, as described in example 14, while the 

other lines were assayed using RT-PCR analysis, as described supra. 

It was found that IL-9 upregulated TIF expression in T helper cells and mast cells, while EL-2 
and IL-3 did not. The 9T7 cell line, however, showed roughly the same level of expression, 
f * regardless of the cytokine, indicating that IL-9 is not mandatory for TIF expression. 

JO Example 16 

K The expression of TIF mRNA in B cell lines was then studied. The cell lines A20, 70Z/3, 

U and BCL-1 are B cell leukemia cell lines which grow, in vitro , without cytokines. These cells were 

I Li stimulated for 24 hours with IL-4 and IL-9 and total RNA was isolated, using standard methods. 

□ Expression was analyzed by RT-PCR which was carried out for 35 cycles, followed by blotting and 

1 5 hybridization, as described supra . 

The results indicated that TIF expression is detectable in B cells, but is weakly upregulated 
at best in the presence of IL-9 and IL-4. 
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Example 17 

Experiments were then carried out to study expression of the inventive molecules in T helper 
cell lines. TS2 and TS1 are known T helper cell lines, derived from T helper cell clones, which 
proliferate in the presence of either IL-9 or IL-2 (TS2) 3 and either IL-9 or IL-4 (TS1). Specifically, 
TS1 or TS2 cells were grown in the presence of the listed cytokines for at least 10 days, after which 
RNA was extracted using known methods. Expression of the molecules was studied via RT-PCR 
(35 cycles), using the protocols described supra . In TS1 cells both IL-4 and IL-9 induce TIF 
expression, but IL-2 does not do so in TS2 cells. 

Example 18 

Expression of TIF mRNA in various mouse organs were studied. Total RNA was prepared 
from liver, kidney, heart, brain, intestine, spleen, thymus, lung, muscle and bone marrow, using 
standard guanidium isothiocyanate methodologies and CsCl gradient centrifugation. Forty cycles 
of RT-PCR were carried out, using the protocols described supra . Strongest expression was found 
in thymus tissue, while less intense signals were found in brain tissue, and weaker expression in the 
remaining tissues. 

Example 19 

The following experiments describe production of TIFa in 293-EBNA cells. 
Complementary DNA for TIFa was described supra . It was subcloned into a commercially 
available expression vector pCEP-4, in operable linkage with a CMV promoter. The resulting 
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plasmids were transfected into 293-EBNA cells, using standard lipofectamine methods. Following 
transfection, the cells were incubated in a methionine free medium, supplemented with 35 S labeled 
methionine, for 24 hours. Supernatant was harvested, and run on an acrylamide gel, followed by 
electrophoresis. The gel was then dried and exposed to autoradiography for 1 day. A control was 
5 then run by transfecting cells with the same plasmid, in which the cDNA was cloned in the antisense 

direction. 

A heterogenous band of about 25-30 kilodaltons was found from the cells transfected with 
TIF in the sense direction. Any discrepancies between the predicted molecular weight, the actual 
molecular weight in the system, and the heterogeneity, can be attributed to glycosylation. In a series 
jl of parallel experiments, cDNA encoding human TIF was expressed in the same way as the murine 
Cfi cDNA was expressed. With the exception of the change of the cDNA, all experimental parameters 

v & were the same. 

ni Example 20 

1 3 Further experiments were carried out to study production of TEFa in COS cells. Specifically, 

1 5 TIFa cDNA was subcloned into the plasmid pEF-BOS .puro described by Demoulin et aL supra , in 

operable linkage with the EF- 1 a promoter. The plasmid cDNA was transfected into COS cells, using 
the same lipofectamine method described supra . The cells were incubated in methionine free 
medium, supplemented with 35 S methionine for 24 hours, after which supernatant was treated as 
described in example 19, supra . Again, a heterogenous band of 25—30 kilodaltons was observed, 
20 as well as an 18 kilodaltonband, which probably represents a non-glycosylated form of the molecule. 
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Example 21 

In these experiments, it was discovered that TIF induces STAT activation in mesangial, 
neuronal melanoma, and hepatoma cells. It is known that when cytokines activate STAT factors, 
the factors dimerize, move from cytoplasm to the nucleus, and bind to target sequences in promoters. 
The details of the experiments follow. 

Transfected 293-EBNA cells as described supra were used following incubation in normal 
medium for 48 hours, as were supernatant from the controls, also described supra . Samples of a 
mouse kidney mesangial cell line, ("MES 13" hereafter), a rat pheochromocytoma cell line, ("PC 1 2" 
hereafter), four different human melanomas (SK23, AUMA, NA-8mel and MULL), human 
hepatoma (HepG3) and rat hepatoma (H-4-II-K) were used. Cell samples (0.5x 1 0 6 ) were stimulated 
for 5-10 minutes in the presence of 1% of supernatant. Nuclear extracts were then prepared, in 
accordance with Demoulin et al., Mol. Cell. Biol. 16: 4710 (1996), incorporated by reference. In 
brief, cells were washed with PBS and then resuspended in 1 ml of ice cold hypotonic buffer for 15 
minutes. (Buffer was lOmM HEPES buffer, pH 7.5, with lOmM KC1, ImM MgCl 2 , 5% glycerol, 
0.5 mM EDTA, O.lmM EGTA, 0.5mM dithiothreitol, and ImM Pefabloc, ImM Na3V 4 , and 5mM 
NaF). Cells were then lysed by adding 65 ul of NP-40, followed by vortexing. Nuclei were pelleted, 
by vortexing for 30 seconds at 14,000 rpm, followed by extraction in buffer supplemented with 
HEPES (20mM), glycerol (20%), and NaCl (420mM). Nuclear debris was removed by centrifuging 
for 2 minutes. DNA binding activity was determined in accordance with Demoulin et al, supra, 
using a 32 P labeled double stranded oligonucleotide called "GRR," which contains the STAT binding 
site of the FcyRI gene promoter, i.e.: 
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5'ATGTATTTCC CAGAAA-3* (SEQ ID NO: 13) 
and 

5*-CCTTTTCTGG GAAATAC-3' (SEQ ID NO: 14) 

corresponding to the upper and lower strands of the binding sites in the GRR probe. Briefly, 5ul 
5 volume of nuclear extracts were incubated in binding buffer (12mM HEPES, pH 7.6, lOmM KC1, 

0.5mM EDTA, 2.5% glycerol, O.lmg of poly(dl-dC) per ml) for 5 minutes. Radiolabeled GRR 

probe (10 5 cpm; approximately 0.5ng) was added, and incubation was continued for 25 minutes 

before loading onto a non-denaturing polyacrylamide gel. 
e = It was also noted that the complexes observed inMES 1 3 cells, described supra, were partially 

|| overshifted by both anti-STAT5 and anti-STAT3 antibodies, showing that (i) the cells under 
m examination were targets for TIF, and (ii) that STAT3 and STAT5 are major components of the 

!'* complex activated by TIF. The difference in STAT profile, as compared to the profile in Example 
L 12, supra , is attributable to the difference in cell source (human versus mouse). It was also observed 

¥ jj that human TIF works on murine cells, and vice versa. 

15 Example 22 

This example details the isolation and cloning of a nucleic acid molecule which encodes 
human TIF. First, human peripheral blood mononuclear cells were prepared via standard density 
gradient centrifugation. Following this preparation, samples were cultured for 24 hours, at 3 x 10 6 
cells/ml, either with or without anti-CD3 monoclonal antibody (The antibody was the commercially 
20 available OKT3 mAb, used in the form of ascites fluid at 1/500 dilution). This antibody was used 
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because T cell derived cytokines are generally expressed only upon activation by e.g., CD3 specific 
antibodies. 

Total RNA was isolated from these cells, using standard guanidine-isothiocyanate / CsCl 
ultra-centrifugation techniques. Following isolation, lO^g samples of the RNA were reverse 
transcribed using an oligo (dT)15 primer. 

Following preparation of cDNA, as outlined supra , samples which corresponded to lOOng 
of total RNA were amplified, via PCR, using the following primers: 
5* - AGGTGCTGAA CTTCACCCTG GA - 3' (SEQ ID NO: 15) 
5' - CCACTCTCTC CAAGCTTTTT CA - 3' (SEQ ID NO: 16) 

which are based upon a murine cDNA sequence, (i.e., SEQ ID NO: 7). The PCR conditions 
involved 30 cycles of amplification, with one cycle defined as 1 minute at 94 °C, followed by 1 
minute at 42 °C, and then 2 minutes at 72 °C. Amplification product was separated on an agarose 
gel, using standard methods, and then sequenced. The result indicated that fragments of the cDNA 
had been amplified. Hence, a second reaction was carried but, using the same materials except SEQ 
ID NO: 16 was replaced by SEQ ID NO: 17, i.e.: 
5' -CAAGTCTACC TCTGGTCTCA T- 3* 

This second PCR reaction was carried out for 25 cycles, with one cycle being defined as 1 minute 
at 94°C, followed by 1 minute at 45 °C, and then 2 minutes at 72°C. The amplification product was 
subjected to the same steps as the first one. A fragment of 41 8 nucleotides was amplified, which was 
found in anti-CD3 stimulated cells, but not resting PBMCs. This fragment's nucleotides sequences 
is set forth at SEQ. ID NO: 18. 
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Example 23 

Following preparation of amplification product, the 5' end of cDNA was isolated by using 
standard, 5' - RACE techniques. In brief, first strand cDNA was prepared by using SEQ ID NO: 1 9 
as a primer, i.e.: 

5' - TGGCCAGGAA GGGCACCACC T - 3' 

This primer was based upon the sequence information obtained in accordance with example 22. In 
brief, the 5* - RACE method was carried out by combining 1 /xg of total RNA, prepared as described 
supra . 2.5 pmoles of SEQ ID NO: 19, reverse transcriptase , reverse transcriptase buffer, 2.5 [A of 
dNTP mix (10 mM), 2.5 ^\ of MgCl 2 (25mM), and 2.5 /A of dithiothreitol (0.1 M). The reaction 
was carried out and, after completion, original RNA was removed via adding RnaseH, and Rnase 
TI. Any unincorporated dNTPs, as well as primer and proteins, were removed. The cDNA was 
tailed using terminal transferase, or "TdT." This enzyme creates a 3'-binding site for the abridged 
anchor primer, as described infra . Tailing was carried out by combining the purified, first strand 
cDNA, TdT, buffer (10 mM Tris-HCl, 25 mM KC1, 1.5 mM MgCl 2 ), and 200 /jM of dCTP. 

Following the tailing reaction, PCR was carried out using: 
5' -CCTATCAGAT TGAGGGAACA G - 3* (SEQ ID NO: 20) 
and 5' - RACE abridged anchor primer: 

5' - GGCCACGCGT CGACTAGTAC GGGnGGGIIGGGIIG - 3' (SEQ ID NO: 21). 
The amplification involved 35 cycles (1 cycle defined as 1 minute at 94°C, 1 minute at 56 °C, and 
2 minutes at 72 °C). Following this, nested amplification was performed on 5 {A of a 1/1 00 dilution 
of the amplification product, using SEQ ID NO: 20 and the abridged universal amplification primer: 



660159.1 



24 



LUD 5664 JEL/NDH 

5' - GGCCACGCGT CGACTAGTAC - 3' (SEQ ID NO: 22). 

Amplification involved 30 cycles (1 cycle being defined as 1 minute at 94°C, 1 minute at 56°C, and 
2 minutes at 72 °C). The resulting PCR product was cloned, following standard procedures, and 
sequenced. 

5 These three protocols, i.e., the two experiments described supra which generated fragments, 

and the 5' - RACE PCR, also described supra , permitted alignment of the sequenced amplification 
product, to generate the complete sequence. 

Following the alignment, oligonucleotides were generated which flanked the deduced open 

3*=% reading frame, i.e.: 

m 5' - CCTTCCCCAG TCACCAGTTG - 3' (SEQ ID NO: 23) 

• 1 and 

L 5' - TAATTGTTAT TCTTAGCAGG - 3' (SEQ ID NO: 24). 

r |j These primers were used to amplify the entire open reading frame, using mRNA from CD3 specific 

d mAb stimulated cells, as described supra . For amplification, 25 cycles (1 cycle being defined as 1 

15 minute at 94°C, 1 minute at 56°C, and 2 minutes at 72°C). 

The complete sequence of the human cDNA is set forth at SEQ ID NO: 25. It contains a 537 
base pair ORF which encodes a 179 amino acid protein. This is the same length as the murine 
protein. The human protein has 79% amino acid homology with the murine protein, and 25% 
homology with IL-10. Human and murine proteins are set out as SEQ ID NOS:43 and 40, 

20 respectively . 
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Example 24 

These experiments detail work on the isolation of human genomic DNA corresponding to 
the cDNA discussed supra . 

Based upon the cDNA sequences, primers were developed which correspond to nucleotides 
51-70 and the complement of nucleotides 631-650 of SEQ ID NO: 25. PCR was carried out, using 
standard methodologies. Specifically, lOOng of genomic DNA taken from CESS cells (an EBV- 
transformed, lymphoblastoid cell line) was used as a template, and 33 cycles of amplification were 
carried out (one cycle of amplification being defined as 94° C for 30 seconds, 50° C for 30 seconds, 
and 72° C for 5 minutes). Once a sequence was isolated, it was sequenced, and this is set forth as 
SEQ ID NO: 26. The sequence is about 4.8 kilobases in length, and is believed to contain the entire 
genomic sequence encoding the TIF molecule, lacking only the 5 ' flanking region, the promoter, and 
the 3 ! end. Analysis indicates that it contains 6 exons and 5 introns, as does the murine genomic 
sequence. 

Southern blot hybridization was carried out, using standard methods. It showed that the 
genome contains only a single copy of the TIF gene. 

Example 25 

It was of interest to identify where the genomic DNA discussed supra was located in the 
human genome. In order to do this, two different approaches were taken. In the first, the sequence 
discussed supra , i.e., SEQ ID NO: 26, was labeled with a flourescent label, and then was used to 
probe the human genome via fluorescent, in situ hybridization ("FISH") using standard methods. 
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In a second approach, a panel of radioactive hybrid clones were screened using the probe 
consisting of nucleotides 51-70 of SEQ ID NO: 25, and 5 '-ATC AGATGGA TT ACTGAATG-3 ' 
(SEQ ID NO:27). PCR was carried out using 25 ng of genomic DNA as a template, for 35 cycles, 
where one cycle is defined as 94°C for 1 minute, 55° C for 1 minute and 72 °C for 2 minutes. 

Both methodologies indicated that the gene is located at chromosome 12ql5. Some work 
links diseases associated with asthma at this site. See, e.g. Nat. Genet. 15:389-392 (1997); Ober, et 
al, Hum. Mol Genet. 7(9):1393-1398(1998); Nickel, et al, Genomic 46(1):159-162(1997); 
Takahashi, et al, Genomics 44(1):150-2(1997); Barnes, et al, Genomics 37(1):41-50(1996), all 
incorporated by reference. 

Public databases were consulted to determine if any part of the sequences were presented 
therein. The last exon of TIF was found on a BAC clone, derived from chromosome 12ql5. 
(Accession No: AC007458; 191,1 1 1 base pairs, BAC RDCI1 1-444B24). The identification of the 
last exon on this clone suggests that the TIF gene is located about 90 kilobases from the IFN gene, 
and less than 30 kilobases from a gene referred to as AK155, which is an IL-10 related cytokine 
(Knappe, et al, J. Virol 74:3881-3887 (2000)). 

Example 26 

These experiments describe the manufacture of antibodies which bind to the TIF protein. To 
make these, a peptide consisting of amino acids 40-61 encoded by SEQ ID NO: 7 was coupled to 
KLH carrier protein, using standard methods and a ratio of 1 mg peptide to 1 mg carrier protein. 
Subject animals (rabbits), were immunized 3 times, at 2 week intervals, with 1 50 ug of the complex. 
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The immunogen was emulsified in Complete Freund's Adjuvant for the first injection, and then 
Incomplete Freund's Adjuvant for the next two. 

A first bleed was performed one month after the last injection, and serum was prepared, 
following known methods. 

The serum was then tested in a standard Western Blot. In brief, 10 |il of supernatant from 
cells transfected with either SEQ ID NO: 7 or SEQ ID NO:25 were separated via SDS-PAGE 
electrophoresis, and then blotted onto PVDF membranes. Antiserum was diluted to 1 : 500, and used 
in a standard Western Blot protocol, together with anti-rabbit antibody as the secondary antibody, 
and a commercially available detection kit. 

It was found that the serum did, in fact, recognize the TIF protein. 

Example 27 

This example describes experiments carried out to identify cell lines which are responsive 
to human TIF. HEK293-EBNA human embryonic kidney cells, described supra , were seeded in 6 
well plates, at 3 x 10 5 cells/well one day before transfection. They were transfected with 2 ^g of 
pCEP-4 plasmid which contained cDNA for human TIF, under control of the CMV promoter. Cells 
were incubated after transfection, in 1.5 ml normal medium, for 3 days, in order to maximize 
production of recombinant human TIF. 

It was hypothesized that human TIF would induce activation of STAT transcription factors 
in the same way the murine factor did. The protocol of example 2 1 , supra was followed. Supershifts 
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were performed by adding anti-STAT antibodies (0.75/^g anti STAT 1, 1 /ug of anti-STAT 3, or 1 
jug of anti-STAT 5b), to mixtures of nuclear extracts and labeled DNA probe, and incubated. 

A TIF induced bandshift was observed when the hepatoma cell line HepG2 was used. The 
antibodies described supra were used to characterize the response further. Anti-ST AT-3 antibodies 
5 shifted most of the retardation complexes, while the weak, remaining complexes were supershifted 

by anti-STAT-1 antibodies. The anti-STAT-5 antibodies had no effect. This indicates that STAT-3 
and, to a lesser extent, STAT-1 are the major transcription factors activated by TIF. These results 
were also obtained using human hepatoma cell line HepG3, and also hepatoma cell line H411E. 

m EXAMPLE 28 

IQ This example describes experiments designed to measure STAT activation by means of a 

H reporter gene. The reporter was "pGRR5." This contains 5 copies of the sequence set out in 

„ example 2 1 , upstream of a luciferase gene under control of the TK promoter. The control was vector 

eft pRL-TK, which contains the remilla luciferase gene, under control of the TK promoter. 
O The assay was carried out by combining 10 6 HepG2 cells with 15 ^g pGRR5, and 1 \i% of 

15 Prl-TK (250V, 74Q, 1,200 /zF). The pool of transfectants was divided into 24-well plates (42,000 
cells/well). 

After 1 hour, cells were stimulated with either human TIF (1% HEK293 cell supernatant) 
with 300U/ml of human IL-6, 1% supernatant from mock transfected HEK293 cells, or medium 
alone. 
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After two hours, cells were pelleted, and lysed. Luciferase activity was monitored using 
standard methodologies. The results indicated that stimulation with the molecule of the invention 
increases transcriptional activity of a promoter that includes STAT-binding sites. 



EXAMPLE 29 

5 Activation of STAT-3 by cytokines like EL-6 is known to result in acute phase protein 

induction in hepatoma cells. To determine if TIF exerted the same activity, 5x1 0 6 HepG2 cells were 

stimulated for 2, 13, or 24 hours, with 1% supernatant from transiently infected HEK293-EBNA 
lh cells. In some experiments protein synthesis inhibitor cycloheximide was used at 10 jig/ml, and 
H combined with the cells and supernatants. Following stimulation, total UNA was isolated using 
|;Q standard methodologies, and reverse transcription was performed on 10 |ig samples of total RNA, 

rfi using an oligo(dT) primer. Then, cDNA corresponding to 20 ng of RNA was amplified, for 18 
O cycles, with primers specific for human serum amyloid A ("SAA"), i.e.: 

agctcagcta cagcacagat 

(sense, SEQ ID NO: 28) 
1 5 cctgccccat ttattggcag 

(antisense, SEQ ID NO: 29) 

Human al antichymotrypsin: 

tgtcctctgc caccctaaca 
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(sense, SEQ ID NO: 30) 
taattcacca ggaccatcat 
(antisense, SEQ ID NO: 31) 
for human haptoglobin: 
gtggactcag gcaatgatgt 
(sense, SEQ ID NO: 32) 
acatagagtgt taaagtggg 
(antisense, SEQ ID NO: 33) 
and for human P-actin: 
gctggaaggt ggacagcgag 
(sense, SEQ ID NO: 34) 
tggcatcgtg atggactccg 
(antisense, SEQ ID NO: 35). 

For SAA, Tm was 54°C, while it was 52°C for human al - antichymotrypsin and haptoglobin, and 
56°C for p-actin. PCR products were analyzed, in ethidium bromide stained agarose gels, using 
standard methods. 

The results indicated that TIF strongly induced SAA and al-chymotrypsin and, to a lesser 
extent, haptoglobin. In order to determine if TIF directly upregulates SAA, or whether protein 
synthesis is required, HEPG2 cells were stimulated, as described, in the presence of cycloheximide. 
SAA expression was not affected, indicating that protein synthesis was not required for TIF activity. 
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EXAMPLE 30 

The results, supra , show that TIF and IL-6 appear to have similar activities on acute phase 
reactants. As IL-6 activity is mediated through gp 130, experiments were carried out to determine 
if TIF activity is mediated through gp 130 as well. 
5 To determine if this was the case, HepG2 cells were transfected with a luciferase reporter, 

as described, supra , and were then stimulated with either TIF or IL-6, in the presence of polyclonal, 
anti-gp 130 antibodies which had been determined previously to block the activity of gp 130 
interacting cytokines. 
^ The results indicated that the polyclonals blocked only IL-6 activity, 

if Parallel experiments were carried out to determine if the IL-10R P chain was involved. In 

v. : - 

fS the presence of anti - IL-10R p antibodies, TIF activity was blocked completely while IL-6 activity 
M was unaffected. The same affect was observed upon assaying for SAA expression. 

r J] EXAMPLE 31 

O These experiments were designed to determine the ability of TIF to regulate acute phase 

15 proteins in vivo . 

Varying amounts of recombinant murine TIF (50, 12.5, 3.2, 0.8 or 0.2 \ig) were injected, 
intraperitoneally, into endotoxin resistant C3H/HeJ female mice (10-12 weeks old). Six hours after 
injection, the mice were killed, with the exception of the mice who received 50 jig of TIF, which 
were killed after 1 , 3, 6, 12, or 24 hours. Livers were removed and directly frozen in liquid nitrogen. 

20 Total RNA was then extracted, using standard methodologies, after which 10 ^g samples of total 
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RNA were fractionated on a 1 .3% agarose gel which contained 2.2 ml/liter formaldehyde. Samples 

were then transferred to nitrocellulose membranes. 

Murine S AA probes were manufactured, using a commercial labelling kit. The hybridization 

assay was carried out in accordance with Van Snick, et al, J. Exp. Med 169:363-368 (1989), 
5 incorporated by reference. The probe itself was obtained via PCR, as described supra , using cDNA 

from murine liver, isolated as described supra . The primers used to manufacture the probe were: 

tctgctccct gctcctggga 

(sense, SEQIDNO: 36) 
-~ and 

\4 s tccaggaggt ctgtagtaat 

m (antisense, SEQ ID NO: 37) 

H The results indicated that the highest dose of the murine TIF (50 jig), induced SAA expression in 
^ as little as 1 hour post i.p. injection. Maximal effect was reached after 6 hours. The expression level 
r l1 for SAA decreased at 24 hours past injection. 

15 The data generated from the experiments using varying doses of TIF indicated that maximal 

induction of SAA still resulted when 3.2 \ig dosages were used. SAA message was still detectable 
when as little as 0.8 j^g were used. 

EXAMPLE 32 

Initially, TIF was identified as a T cell derived cytokine. Most cytokines that regulate the 
20 liver acute phase are produced mainly during inflammation. In order to determine if TIF could be 
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produced via inflammatory stimulation, in vivo, 2 ng of E.coli lipopolysaccharide antigen were 
inj ected, intraperitoneally into 1 2 week old, female B ALB/c mice. Mice were killed two hours later, 
and organs were frozen in liquid nitrogen. Total RNA was isolated following standard protocols, 
and reverse transcription was performed on 10 jig total RNA, using oligo (dT) primers. Following 
the reverse transcription, cDNA corresponding to 20 ng of total RNA was amplified for 25 cycles, 
using primers specific for TIF, i.e.: 
ctgcctgctt ctcattgccc t 
(sense, SEQ ID NO: 38) 
and 

caagtctacc tctggtctca t 
(antisense, SEQ ID NO: 39), 

which had a Tm of 55°C. PCR products were analyzed via agarose gel electrophoresis. 

The results indicated that LPS induced expression of TIF in all organs examined, indicating 
that TIF is involved in inflammatory processes. 

The foregoing examples describe the invention, one aspect of which are isolated nucleic acid 
molecules, which encode TIF proteins such as those with the amino acid sequence of the protein 
encoded by the nucleotide sequence of SEQ ID NO: 7, 25 or 26. It will be appreciated by one of 
ordinary skill that the degeneracy of the genetic code facilitates the preparation of nucleic acid 
molecules which may not be identical to the nucleotide sequence of SEQ ID NO: 7, 25 or 26, but 
which encode the same protein. Of course, SEQ ID NOS: 7, 25 and 26 are preferred embodiments 
of this invention, but other embodiments are also a part of the invention. Genomic DNA, 



660159.1 



LDD 5664 JEL/NDH 



complementary DNA, and RNA, such as messenger RNA, are all to be included therein. Isolated 
nucleic acid molecules from other animal species, including other mammals, are also a part of the 
invention. A preferred aspect of the invention are isolated nucleic acid molecules whose 
complements hybridize to SEQ ID NO: 7, SEQ ED NO: 8, SEQ ED NO: 9, or SEQ ED NO: 25 or SEQ 
ED NO: 26 under stringent conditions. "Stringent conditions," as used herein, refer, for example, to 
hybridization at 65°C in buffer (3 .5xSSC), 0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine 
serum albumin, 25mM NaH 2 P0 4 (pH 7), 0.1% SDS, 2mM EDTA, followed by a final wash at 
2xSSC, room temperature and then 0.1xSSC/0.2xSDS at temperatures as high as, e.g., about 65°C. 
More stringent conditions, such as O.lxSSC, can also be used. These nucleic acid molecules encode 
proteins of about 17-22 kD as determined by SDS - PAGE, which activates STAT proteins, such 
as STAT 1, STAT3 and/or STAT5. In glycosylated form, these proteins can range from about 17 
to about 30 kilodaltons, as determined by SDS - PAGE. Also a port of the invention are isolated 
nucleic acid molecules which encode proteins having at least 30%, preferably at least 45%, more 
preferably at least 60%, and most preferably 90% amino acid identity with an amino acid sequence 
of a protein encoded by SEQ ID No: 7, 25 or 26, e.g., SEQ ED NO: 42. 

Also apart of the invention are expression vectors which include the nucleic acid molecules 
of the invention, operably linked to a promoter, so as to facilitate expression of the DNA. It is well 
within the skill of the artisan to prepare such vectors. 

The vectors, as well as the nucleic acid molecules per se, can be used to prepare recombinant 
cells, be these eukaryotic or prokaryotic, wherein either an expression vector or the nucleic acid 
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molecule itself is incorporated therein. E. coli cells, COS cells, CHO cells, etc., are all examples of 
types of cells which may be used in accordance with this aspect of the invention. 

Proteins encoded by the above referenced nucleic acid molecules, preferably in isolated form, 
are another feature of this invention. By "protein" is meant both the immediate product of expression 
of the nucleic acid molecules, glycosylated forms of it, as well as multimeric forms, such as dimers, 
trimers, and so forth. Also a part of the invention are multimers, such as dimers, which contain at 
least one protein molecule of the invention, and at least one, different protein molecule. Preferably, 
this different protein molecule is a cytokine, such as IL-10. Also included as a feature of the 
inventions are constructs, such as fusion proteins, where all or apart of the proteins described supra 
are linked in some fashion, such as in a fusion protein, to at least one additional protein or peptide, 
or amino acid sequence. The "fusion partner" may be, for example, a molecule which provides a 
recognizable signal, either directly or indirectly, such as a FLAGpeptide, P-galactosidase, luciferase, 
and so forth. These fusion partners are preferably joined to the molecule which is described supra 
at the N- and/or C- terminus of the protein; however, it is to be understood that there are many 
techniques known for joining molecules to amino acids, and any and all of these methodologies can 
produce constructs which are a part of the invention. 

The individual protein molecules of the invention, as noted supra , will preferably have a 
molecular weight of from about 17 to about 30 kilodaltons, as determined by SDS-PAGE. In 
multimeric forms, the molecular weight of the complex will, of course, vary, but the TIF molecules 
contained therein will each have a molecular weight of about 1 7 to 30 kilodaltons, as determined by 
SDS-PAGE. 



660159 1 



36 



LUD 5664 JEL/NDH 



The proteins preferably consist of at least about 1 20 and no more than about 200 amino acids. 
Preferably, the amino acids sequences consists of or comprises all or part of the amino acid 
sequences encoded by SEQ ID NOS: 7, 8, 9, 25 or 26. More preferably, the amino acid sequence 
contains all but about the first 40 amino acids encoded by said SEQ ED's. Even more preferably, it 
contains all but about the first 20 amino acids encoded by these sequences. Most preferably, the 
protein comprises amino acids set forth at SEQ ID NO: 40 or 41 , as well as proteins defined by the 
amino acid identities set out supra . 

It will be appreciated by the skilled artisan that the proteins encoded by the above recited 
nucleic acid molecules are a feature of the invention, and may be used to produce antibodies, in 
accordance with standard protocols. Such antibodies, in monoclonal and polyclonal form, constitute 
a further feature of the invention as do fragments of said antibodies, chimeric forms, humanized 
forms, recombinant forms, and so forth. Also a feature of the invention are immunogens, comprising 
all or a part of the amino acid sequence protein molecules of the invention, preferably combined with 
an adjuvant, such as Complete or Incomplete Freund's Adjuvant. Portions of the protein sequences 
may be linked to other molecules, such as keyhole limpet hemocyanin, to render them more 
immunogenic. These antibodies can be used, e.g., to determine if the proteins of the invention are 
present. This is a further feature of the invention, as is now explained. It has been shown, in the 
examples, that the nucleic acid molecules of the invention were expressed in the presence of the IL- 
9. Hence, a further feature of the invention is a method to determine if IL-9 is or has been present, 
wherein one detects either the proteins of the invention, using antibodies for example, or mRNA 
using the nucleic acid molecules of the invention, as probes. The mRNA can be determined directly, 
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or in the form of cDNA. Such probes may or may not be labeled, as a matter of choice for the user. 
Hence, one can determine, for example, if, following administration of IL-9, the cytokine is still 
efficacious, by determining if the nucleic acid molecule of the invention is present. This type of 
assay can be adapted, for quantitative studies, wherein one determines, for example, either if a cell 
is sensitive to IL-9, and if so, how sensitive it is. One can also use the proteins of the invention to 
phosphorylate STAT proteins such as ST ATI, STAT3 and/or STAT 5. This in turn results in 
dimerization of the STAT protein, followed by migration to the nucleus to provoke the effect that 
these STAT proteins have on cells. 

One could also use these molecules to test the efficacy of EL-9 agonists or antagonists when 
administered to a subject, such as a subject suffering from lymphoma, an immune system disorder 
such as an allergy, acquired immune deficiency syndrome, autoimmune diabetes, thyroiditis, or any 
of the other conditions described in, e.g, U.S. Patent No. 5,830,454; 5,824,551, and pending 
application Serial No. 08/925,348, filed on September 8, 1997 now allowed, all of which are 
incorporated by reference. The molecules can also be used to mediate the role of IL-9 in these and 
other conditions. To elaborate, since IL-9 induces HFs, the TIFs are useful as IL-9 activity 
mediators. Thus, a further aspect of the invention is a method to determine activity of endogenous 
IL-9, such as in situations where excess IL-9 activity is implicated, such as asthmas, allergies, and 
lymphomas. One can also block or inhibit IL-9 activity by blocking or inhibiting TIF or TIF activity, 
using, e.g., antisense molecules, antibodies which bind to TIF, or other antagonists of these 
molecules. For example, muteins of TIF, which bind to the TIF receptor but do not activate it, 
thereby inhibiting IL-9 induced activity, are a feature of the invention. Examples of conditions 
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which can be treated by the use of such TIF muteins are allergies, asthma, and so forth. Muteins in 
accordance with the invention can be made in accordance with, e.g., Weigel, et al, Eur. J . Biochem 
1 80(2):295-300(l 989) and Epps, et al, Cytokine 9(3): 149-1 56(1 997), both of which are incorporated 
by reference. Such muteins can be used in the treatment of asthma, allergies, or both. Further, it will 
be clear to the skilled artisan that the models set forth, supra , can also be used to screen for 
appropriate muteins. The ability to regulate IL-9 activity is important in conditions such as those 
listed supra , as well as conditions such as apoptosis, including Cortisol induced apoptosis, conditions 
involving the nuclear expression of BCL-3, since EL-9 is known to induce such expression, and so 
forth. "Antibodies," as used herein, refers to any portion of an antibody which binds to TIF, 
including chimeric and humanized antibodies. 

Another feature of the invention relates to the ability of the TIF type molecules of the 
invention to either promote regeneration or inhibit differentiation of tissue types on which the 
molecules are active. As was shown, supra , the TIF molecules target various cancer and normal cell 
lines (i.e., mesangial and neuronal cells, as well as melanoma and hepatoma cells). Hence, one can 
stimulate regeneration of tissue via, e.g., adding an amount of a TIF type molecule to a sample in 
need of regeneration of a tissue acted on by the TIF molecule. This approach can be used both in 
vitro , and in vivo . Similarly, antagonists of TIF may be added when the situation is one where the 
aim is to inhibit differentiation of a particular type of tissue, such as melanoma or hepatoma. 

The genes which encode TIF, as noted in Example 25, supra , are located on chromosome 12. 
This chromosome is associated with asthma, as is known in the art. It is also known that region 
12ql 5 is associated with other inflammatory disease. Hence, a further embodiment of the invention 
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is a method for determining susceptibility to conditions such as, or related to asthma, by determining 
if aberrations, such as polymorphisms, deletions, additions, etc., are present at the site of the TIF 
gene. Such aberrations may be an indicia of susceptibility to, or of the presence of, asthma, an 
allergic condition, or one or more related conditions. The ability to detect aberrations in a DNA 
sequence is well known in the art, and such methods need not be set forth herein. Preferably, the 
aberration or aberrations is detected via standard techniques, such as PCR, using the methodologies 
and primers referred to supra . 

The data set forth supra , in particular the induction of expression of the molecule of the 
invention by LPS, and the regulation of the acute phase response by this molecule, indicate that the 
molecules of the invention are actively involved in the inflammatory response. Hence a further 
feature of this invention relates to identification of proinflammatory and anti-inflammatory agents 
via use of IL-TIF/IL-21. It will be clear to the skilled artisan that IL-TIF/ IL-21 can regulate the 
inflammatory response. A preferred aspect of this regulation is the modulation of the acute phase 
response by organs, such as the liver, by administering either IL-TIF/IL-2 1 , or an antagonist thereof. 
See, e.g. , Janeway, et al. , Immunobiology (4 th edition), incorporated by reference. Janeway explains 
that various cytokines such as IL-1, IL-6 and TNF-a activate hepatocytes to synthesize acute phase 
proteins, such as c-reactive protein, and mannan binding lectin, as well as those described in the 
examples, supra . 

The role of IL-TIF/IL-21 in activating acute phase proteins also leads to a method for 
identifying agonists and antagonists of IL-TIF/IL-21, by determining if acute phase protein 
production changes in the presence of the putative agonist or antagonist, and IL-TIF/IL-21. An 
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increase in production should be taken as an indicia that the test molecule is an agonist, and vice 
versa. 

Also a part of the invention are methods for regulating activity of EL-TIF/IL-21 in view of 
its relationships to interleukin-10 receptors. As was shown, supra , the use of IL-10Rp antagonists, 
such as antibodies, inhibit IL-TIF/IL-21 activity. Hence, another feature of the invention is the use 
of agonists and antagonists of IL- 10 receptors, such as IL-10Rp agonists, and antagonists, to regulate 
EL-TIF/IL-21 production. 

Other features of the invention will be clear to the artisan and need not be discussed further. 

The terms and expressions which have been employed are used as terms of description and 
not of limitation, and there is no intention in the use of such terms and expressions of excluding any 
equivalents of the features shown and described or portions thereof, it being recognized that various 
modifications are possible within the scope of the invention. 
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WE CLAIM 

1 . A method for stimulating expression of a STAT transcription factor, comprising contacting 
a cell capable of said expression with an amount of an IL-TIF/IL-21 to said cell sufficient to 
stimulate said expression. 

2. The method of claim 1, wherein said STAT transcription factor is STAT 3 or STAT 1 . 

3. The method of claim 2, wherein said STAT transcription factor is STAT 3. 

4. The method of claim 1 , wherein said IL-TIF/IL-2 1 is a mammalian EL-TIF/IL-2 1 . 

5. The method of claim 4, wherein said mammalian IL-TIF/IL-21, IL-TIF/IL-21 is human IL- 
TIF/IL21. 

6. The method of claim 5, wherein said human IL-TIF/IL-21 is encoded by SEQ ID NO: 25 or 
SEQ ID NO: 26. 

7. A method for inducing production of an acute phase protein in a cell, comprising contacting 
said cell with an amount of an IL-TIF/IL-21 sufficient to induce production of said acute 
phase protein. 
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8. The method of claim 7, wherein said cell is a liver cell. 

9. The method of claim 7, wherein said acute phase protein is human serum amyloid A, al 
chymotrypsin, or haptoglobin. 

10. The method of claim 7, wherein said IL-TIF/IL-2 1 is a mammalian IL-TEF/IL-2 1 . 

5 11. The method of claim 1 0, wherein said mammalian IL-TIF/IL-2 1 is human IL-TIF/IL-2 1 . 

^ 12. The method of claim 1 1, wherein said human IL-TIF/IL-21 is encoded by SEQ ID NO: 25 

a: : s 

m or SEQ ID NO: 26. 

m 13. A method for modulating activity of an IL-TIF/IL-2 1 molecule, comprising contacting a cell 
I * susceptible to IL-TIF/IL-2 1 activity with an IL-TEF-IL-2 1 modulator, in an amount sufficient 

dj to modulate IL-TIF/TL-21 activity. 

14. The method of claim 13, wherein said modulator is a substance which binds to IL-lORp 
molecules. 

1 5 . The method of claim 1 4, wherein said modulator is an antibody which binds specifically to 
an IL-lORp molecule. 
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16. The method of claim 16, wherein said modulator is an antagonist of an EL-10R molecule. 

17. The method of claim 16, wherein said IL-1 OR molecule is an agonist of an IL-1 OR molecule 

18. The method of claim 14, wherein said molecule is an agonist of an IL-10R molecule. 

19. The method of claim 18, wherein said IL-10R molecule is DL-10Rp. 

20. A method for determining exposure to an inflammatory substance, comprising assaying a 
sample taken from a subject believed to have been exposed thereto for expression of IL- 
TIF/IL-2 1 wherein expression of TIF is indicative of exposure to an inflammatory substance. 

21. A method for identifying a modulator of IL-TIF/IL-21, comprising contacting a substance 
believed to be a modulator of IL-TIF/IL-21 to a source of IL-TIF/IL-21 and a cell which 
expresses an acute phase protein, and determining acute phase protein produced by said cell, 
wherein a change in production of said acute phase protein relative to production by said cell 
in the absence of said substance is indicative of said substance being an IL-TIF/IL-21 
modulator. 

ABSTRACT OF THE DISCLOSURE 
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The invention involves isolation of nucleic acid molecules, the expression of which are 
upregulated by interleukin-9. The amino acid sequences of the proteins which correspond to the 
nucleic acid molecules show some structural features of cytokines. In addition to the nucleic acid 
molecules and the proteins, various uses of the molecules are disclosed. The molecules are referred 
to as T cell inducible factors. The molecules are implicated in activation of STAT molecules, acute 
phase proteins, and inflammation. 
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<120> Isolated Nucleic Acid Molecules which Encode T Cell Inducible Factors, or 
Interleukin-21, The Proteins Encoded, and Uses Thereof 

<130> LUD 5664 

<140> 
<141> 

<150> US09/419,568 
<151> 1999-10-18 

<150> US09/354,243 
<151> 1999-07-16 

<150> US09/178,973 

<151> 1998-10-26 

<160> 43 

<210> 1 

<211> 24 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 1 

agcactctcc agcctctcac cgca 24 

<210> 2 

<211> 12 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 2 

gatctgcggt ga 12 

<210> 3 

<211> 24 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 3 

accgacgtcg actatccatg aaca 24 

<210> 4 

<211> 12 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 4 

gatctgttca tg 12 

<210> 5 

<211> 24 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 5 

aggcaactgt gctatccgag ggaa 24 

<210> 6 

<211> 12 

<212> DNA 

<213> Mus musculus 



<220> 
<400> 6 

gatcttccct eg 12 

<210> 7 

<211> 1119 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 7 

taaacaggct ctcctctcac ttatcaactg ttgacacttg tgegatctet gatggctgtc bU 

ctgcagaaat ctatgagttt ttcccttatg gggactttgg ccgccagctg cctgcttctc 120 

attgecctgt gggcccagga ggcaaatgcg ctgcccgtca acacccggtg caagcttgag 180 

gtgtccaact tccagcagcc gtacatcgtc aaccgcacct ttatgctggc caaggaggee 240 

agecttgeag ataacaacac agaegtcegg etcategggg agaaactgtt ccgaggagtc 300 

agtgctaaag ateagtgeta cctgatgaag caggtgetea acttcaccct ggaagacgtt 360 

ctgctccccc agtcagacag gttccagccc tacatgeagg aggtggtacc tttcctgacc 420 

aaactcagca atcagctcag ctcctgtcac ateageggtg acgaccagaa catccagaag 4 80 

aatgtcagaa ggctgaagga gacagtgaaa aagcttggag agagtggaga gatcaaggcg 540 

attggggaac tggacctget gtttatgtct ctgagaaatg ettgegtctg agegagaaga 600 

agctagaaaa cgaagaactg ctccttcctg ccttctaaaa agaacaataa gatccctgaa 660 

tggacttttt cactaaagga aagtgagaag ctaacgtcca tcatcattag aagatttcac 720 

atgaaacctg gctcagttga aaaagaaaat agtgtcaagt tgtccatgag accagaggta 780 

gacttgataa ccacaaagat tcattgacaa tattttattg tcactgatga tacaacagaa 840 

aaataatgta ctttaaaaaa ttgtttgaaa ggaggttacc tctcattcct ttagaaaaaa 900 

agcttatgta acttcatttc catatccaat attttatata tgtaagttta tttattataa 960 

gtatacattt tatttatgtc agtttattaa tatggattta tttatagaaa cattatctgc 1020 

tattgatatt tagtataagg caaataatat ttatgacaat aactatggaa acaagatatc 1080 

ttaggcttta ataaacacat ggatatcata aaaaaaaaa 1119 

<210> 8 

<211> 7445 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 8 ^ n 

gtctatcact tgcttaagat tcttctaatt tataaaaaaa actatttctt aaaatgaaaa 60 

gcaaacagag caegtattta tagcatggtg ttctgaccat gcaggtacag agtggaatgg 120 

taagaggege tattatcagc attaaccaac atgttaatgt tttcttctgg caagcaaact 180 

tgaaatctat gtcttaaaca atcttcaagc ctctaatata gtgetaaega ctggagtccg 240 

ctgctgtcca acagagctct tgagcacget ctcctctgtt tgcaatttta tgttctttga 300 

tcgactcccc aacctctcac cttcggctcc tgatggccac ctttcaactt tetgeattta 3 60 



tgaactccat gttttaatct ttttattaaa atattcacac aatcagtgtt tgtgcaagtc 420 
tgtttcaccc acatgtatgt ctgtgcacca agtgctgcct ggtgcttgtg ggggcaagga 480 
gcaggagagg gtgccctggc accggagtca cggatggttg tgagccacca tgaggatgct 540 
gggagttaga cccaggtcct ccagaagtgc agcaaatgct cttaaccaca cgcaggcatt 600 
tctctctcca gccccaacat gagtgctttt agattccacc tagaatagag atctgatggc 660 
ttcactcact gccacctccc ctttgcatct ttctgccaag gaacaccaaa aagcaagaat 720 
ccccacactg ctttcgctcc tcaagtctgc acctctcaac aggtcaagat tctccagtgt 780 
ccctctaaca ctttccccag tgtccctcta acactttctc cagtgtccct ctaacacttt 840 
ctccagtgtc cctctaacac ttttgatctc aattagctga ggggagaaag atctcacaca 900 
gtgattttca tgacttcgcg ttctagtcta gatgtaggca tttgcgtgtc agtctagggt 960 
aggcgtctgc tcccgctgct taggaaagac tttcctagtc tagttgtcag gtgctatctg 1020 
ggattcagtg tacatacaat gcaaaaaatc ccagtatttt gtaaattctc ttcttcaact 1080 
atccatctat atagtatgtt attgtaggct catttaaaaa taatattttg agacttatgc 1140 
ttgcacaagt aaaatgtcag agaattagca aatgtatagt attattttat tttaaaaaaa 1200 
tctatgctta aaatgtctat tagattgttc actaccgata tttccaaact taacttgacc 12 60 
ttggctatga tttcaacctt tgtatttgca tctaccataa cagtctctga accagaacat 1320 
tctgtggcaa tgggagctgt gaagaaagcc aacattctta ttaaaaaaaa aaaacagcta 1380 
gttatagttt aggattccat atactaaaaa aaatagagat ataattattt taaaaattga 1440 
aataatctcc aagttttcat tatggcttat ttcaaagcac agaatatagg acacgggtct 1500 
tttatttctg gtcacttcta aagagataag aatctatgaa gttggtggga aaatgagtcc 1560 
gtgaccaaaa cgctgactca atagctacgg gagatcaaag gctgctctac tcaatcagaa 1620 
tctactacgg caaagccatg gctttctttg aaaaccgtgt ttagaagatt tctgggattt 1680 
gtgtgcaaaa gcaccttgtt ggccctcacc gtgacgtttt agggaagact tcccatctct 1740 
caaggtggga aggcttggag gtggtgtctt gtggcctcct atggtggtta ggtacttctc 1800 
agaagacagg actggaaatt agataatgtc tgatgtcata tcattcacaa taccaaaaaa 1860 
accctggtgt cccgatggct ataaaagcag caacttctgc ctctcccatc acaagcagag 1920 
acacctaaac aggtaagcac tcagacctct acagacaatc atctgcttgg taccatgcta 1980 
cccgacgaac atgctcccct gatgtttttg ccttttgctc tctcactaac aggctctcct 2040 
ctcacttatc aactgttgac acttgtgcga tctctgatgg ctgtcctgca gaaatctatg 2100 
agtttttccc ttatggggac tttggccgcc agctgcctgc ttctcattgc cctgtgggcc 2160 
caggaggcaa atgcgctgcc cgtcaacacc cggtgcaagc ttgaggtgtc caacttccag 2220 
cagccgtaca tcgtcaaccg cacctttatg ctggccaagg aggtacagct gcatctcttt 2280 
ctctccatac cgccttgcca ttttctctga agcacttgca aactctttag gggcgcttta 2340 
tctccgcagg tctcactacc tatgttttct gtctctttag agactcttta aggactgggt 2400 



ctttttctat ttctatttca aggtctcagg accatttcct atcttggcct tcaggacaca 2460 
tatactgaat tttatctaca gaggcgcatt tagaaagcca cccacgactg caatactttc 2520 
catttctctg tgctctcttc tgaactcata ctctcttggc tactcctgag acccactgcg 2580 
gacatacatc tctacttaca ggcttttctt ccatctcctt gtcacccagg cacttagggt 2640 
tttctctctt tcaggccagc cttgcagata acaacacaga cgtccggctc atcggggaga 2700 
aactgttccg aggagtcagt gtaagtcctc actgtgatga gcagggctag ctgcgggagc 27 60 
tggtggaccc tctgggatag tctgacgtat gacccctgct gcttcttgtc tacctgcagg 2820 
ctaaagatca gtgctacctg atgaagcagg tgctcaactt caccctggaa gacgttctgc 2880 
tcccccagtc agacaggttc cagccctaca tgcaggaggt ggtacctttc ctgaccaaac 2940 
tcagcaatca gctcagctcc tgtgtaagtc tgactctggc tacctatgct cctctctctt 3000 
cctcttctat tccagtaaga acccgaggtc ctgccctctc tctcttcaca agagtgagga 30 60 
gggcctcagc accaccacca tcataggcca cttgaaatag gtcacaaagg ctttggcttc 3120 
aattgagtaa tactttgagt ttgtatgagt gaagctttat ttgttttatc catggaaaga 3180 
aatcaactca aattctgtag gatgagaaag atgttgggaa cgaaaaaagg cctagataga 324 0 
gaaacagatc tgctgagtat agtacttatg gggggagcag ggggcgatat ccactgagta 3300 
caagtacttg tggggagaga aatccactga gtacaagtac ttgttggcat ggagatccac 3360 
tgagtacaag tacttgtggg gggagggaat ggcacagagc aaaagttgaa gggaaggaag 3420 
atggagaggc ctcatggttg ggggtgtgaa aggtcactcc ttttccatgt gatggagagt 34 80 
taagaaaaac cagtgtgtga gtttgatgtc ttcagacacc cccaactatg aaacatatcc 3540 
acgaggagcg ggcagactgt gggagacctg gcatttaggg aaggcgcggc ttttcacacg 3600 
agaaacttta tgctcatctc ttgtgctaca ctcccacctt tgatgaggtt cagctcaggt 3660 
ttcgtttcta ccgttcttgc tactggtgga aacttcagta ggattcccca aagacgagga 3720 
cagctcttct gtaagggagg gacctggatt tcagtgtcct agagaacgaa atagctcaga 3780 
gaatctaggt caacgtgaaa tctaggtcac agcgggcaaa aatgactgaa cgcctctatt 384 0 
ccaggtgaac ggtcacgtgc ctcagatata ctgaggtatt gggctcccac cggataagat 3900 
tctgttagtg agtctgcttt tattttgcag cacatcagcg gtgacgacca gaacatccag 3960 
aagaatgtca gaaggctgaa ggagacagtg aaaaaggtac tattggcaag ccacaatact 4020 
aagccattca gtaggagacg tggggatttc tttctctgct tcccagtccc ttctactttg 4080 
taacatttta tttgacttgt ctactatctg gtccattact cgcttagctg cacctgtatc 4140 
tagctgggtc tatagatctt tcaatctgtg tctaaatttg taagtcacaa ttctggagct 4200 
agcagaaagc ttagctcagc cagtctcatg agcacttgct cggaggatgg cttgtgacag 4260 
agtcaatgct agaagacagc atccctgatt cccagctctg cacttgccta gtggccatgt 4320 
gtaattactt tggcttgatt aagtatttgg gaaagccagt tcccacggac ctacataatc 4380 
tgaagaacca tgcattgaaa actagaaagc tgggcacaaa cttactagag atgatttttg 44 40 



agctcattaa acggatgctc tgaaatgtgg caaaatcaac ccagaataac aacaaaagag 4500 
ctggatttgc aaataggaca agtatttaga atcactggta ttaatagcta tcatcttaat 4560 
taaaatatag ggcctatata tatatttaag attaaacaca agagtggata gcctcccaat 4 620 
ttacttggcc tggtttcaaa agagtaaaaa tatcagtcat ggattaatta tagtgtcatg 4680 
aaagtatgag atggaaaccc tttccttact ttttaccttc atttcttagt tttttttttc 4740 
ttcacaccct gatcaagcca ctagtaagca cctatctgct gtgagctatt atatgacttt 4800 
acagcaaaca acattgctgt gtggcctctt tggggaaggg aacaggatag caggaggctc 4860 
aggctagcaa gtctgacttg ccctaaagcc agaggcatgg ttgatagcag agaaagtgag 4 920 
gctcttcgca agtgggtgtg cttaagtaat cagaaacagg aaggctccgg ttgatggaat 4 980 
tatcagtaag atatctaccc ttatctcctt ctatcgaacc taaatcgtct ctttttcttg 5040 
tgtgtaggct gataaacaca cttgttttct tttgagtgtt catggctttg tagattttta 5100 
gtgctctgcc agttcttgtt agagggtttg ttaccttgac acctgggctt ggatgttagc 5160 
atgccaaagg cacacacttc tgaatgcctg tgtaaaaggt tattattcat ttactttgtc 5220 
tttggaaagg tgaagcgtgt gtgagaaaga actcacagga gatgtgttct ctgtaggaaa 5280 
actttttttt tccccttaaa tgcctataat ccactttcag tcaactttga cttttatacc 5340 
atgctgtcac atgaaagagt gtttaggccc gctctcatgg ctctgggaaa agcaccaata 5400 
ggggaaggaa tgttatgctg agaaatctga ccggcaggga aactggtcag agctcccccg 54 60 
aagaccacca caggtgttaa gtaggaacag tccagggtgg gctcatgtaa tagaatggaa 5520 
cagagcgagg gaagataagc tacaaagttt catagggtcc ggagtcttaa agatacaaaa 5580 
tagctgcttg ggcttcataa caaaggaagt ctgggaaggc agcaagtgag agggaaatgg 5640 
aaagggaaaa aacagaatgt agaggacttg aacagctaca aatcctctac cagacgattt 57 00 
ttcttggaac aatctagaag gtagtggatt aggtgattgc agggggactt gctttgccat 57 60 
ttgaatctgg gtttttgtct ctccattgag gttgaaagcg tcaccctttt taccctcgaa 5820 
tggaggagga aagaaggggt gttatgactc ctacctggag ttttactagt ttacgcaatg 5880 
gaacagacac tcgggacctc ctcttgacaa aaaaaatgga aacctgttgt ttgtcttgtt 5940 
tgttcttttg ttaagaaagc acaggcaaag cccgaccaca tgggttgaat gtgggtcttt 6000 
gagtcaaggc ttttgagttg agcactcatc aatagttgat catggtcagg tggagggcta 6060 
cctgtcaggc cgagccctgc tggcttcgca cttaacatct ccaggtctca gtatcacttc 6120 
ctgctactta gcacagttag gagttgagca aacctttttt tccaaccccc actaaaattt 6180 
aattgacaaa agactgtgta atttgtggga tacagtgtga taattgatct atgtgtgcat 6240 
tgtgcaaggt tcaataagat agattaatag gcccatcaac agctttatgg gtgtgaaatg 6300 
caagtaatat aggtagatgc ctgtggtgtc cttaggtcag aaaggcatga ttttaaggtc 6360 
ttgggcaaat catattatac tcatgctaaa aatacattat gttgattatt aatcttttag 6420 
agaaggctga tacttggttt tggtgctcag caagcaaatg tcaccagctc tttctaactg 6480 



gtaccacttt agaaaatgct acctgtgctc 
tggagagagt ggagagatca aggcgattgg 
aaatgcttgc gtctgagcga gaagaagcta 
taaaaagaac aataagatcc ctgaatggac 
gtccatcatc attagaagat ttcacatgaa 
caagttgtcc atgagaccag aggtagactt 
tattgtcact gatgatacaa cagaaaaata 
ttacctctca ttcctttaga aaaaaagctt 
atatatgtaa gtttatttat tataagtata 
atttatttat agaaacatta tctgctattg 
acaataacta tggaaacaag atatcttagg 
tctgtcttgt aatttttctc cctttaatat 
atcattctca tgatttcatg cttgacccat 
ctgtggtttt gtgtgtgttg tgtgtgtgtg 
atattaggtg ttcttctcta tcagtctttg 
aacctgtagc taggctggcc aacaagctct 
tgtat 

<210> 9 

<211> 1111 

<212> DNA 

<213> Mus mus cuius 

<220> 

<400> 9 

aacaggctct cctctcagtt atcaactttt 
gcagaaatct atgagttttt cccttatggg 
tgccctgtgg gcccaggagg caaatgcgct 
gtccaacttc cagcagccgt acatcgtcaa 
ccttgcagat aacaacacag acgtccggct 
tgctaaggat cagtgctacc tgatgaagca 
gctcccccag tcagacaggt tccggcccta 
actcagcaat cagctcagct cctgtcacat 
tgtcagaagg ctgaaggaga cagtgaaaaa 
cggggaactg gacctgctgt ttatgtctct 
ctagaaaacg aagaactgct ccttcctgcc 
gactttttta ctaaaggaaa gtgagaagct 
gaaacctggc tcagttgaaa gagaaaatag 



aaattggttt gtattcttat tttcatagct 6540 
ggaactggac ctgctgttta tgtctctgag 6600 
gaaaacgaag aactgctcct tcctgccttc 6660 
ttttttacta aaggaaagtg agaagctaac 6720 
acctggctca gttgaaaaag aaaatagtgt 6780 
gataaccaca aagattcatt gacaatattt 684 0 
atgtacttta aaaaattgtt tgaaaggagg 6900 
atgtaacttc atttccatat ccaatatttt 6960 
cattttattt atgtcagttt attaatatgg 7020 
atatttagta taaggcaaat aatatttatg 7080 
ctttaataaa cacatggata tcataaatct 7140 
caacaatacc atcatcatca tcattaccca 7200 
attatactgt taaagttggt tcctggaggc 7260 
tggggttatg catgtgaaag ccagagatgg 7320 
ccttattatt tgagacaggg tctgtcactg 7380 
attaattttt tttaagatta attaattatg 7 44 0 

7445 



gacacttgtg cgatcggtga tggctgtcct 60 

gactttggcc gccagctgcc tgcttctcat 120 

gcccatcaac acccggtgca agcttgaggt 180 

ccgcaccttt atgctggcca aggaggccag 240 

catcggggag aaactgttcc gaggagtcag 300 

ggtgctcaac ttcaccctgg aagacattct 360 

catgcaggag gtggtgcctt tcctgaccaa 420 

cagtggtgac gaccagaaca tccagaagaa 480 

gcttggagag agcggagaga tcaaagcgat 540 

gagaaatgct tgcgtctgag cgagaagaag 600 

ttctaaaaag aacaataaga tccctgaatg 660 

aacgtccacc atcattagaa gatttcacat 720 

tgtcaagttg tccatgagac cagaggtaga 7 80 



cttgataacc acaaagattc attgacaata 
agtatgtact ttaaaaaatt gtttgaaagg 
cctatgtaac ttcatttcca taaccaatac 
atacatttta tttatgtcag tttattaata 
ttgatatttg agtataaagc aaataatatt 
taggctttaa taaacacatg aatatcataa 



<210> 10 

<211> 21 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 10 

ctgcctgctt ctcattgccc t 21 

<210> 11 

<211> 21 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 11 

caagtctacc tctggtctca t 21 

<210> 12 

<211> 20 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 12 

gacgcaagca tttctcagag 20 

<210> 13 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 13 

atgtatttcc gcagaaa 16 

<210> 14 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 14 

ccttttctgg gaaatac 17 

<210> 15 

<211> 22 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 15 

aggtgctcaa cttcaccctg ga 22 



<210> 16 

<211> 22 

<212> DNA 

<213> Mus musculus 

<220> 



ttttattgtc attgataatg caacagaaaa 840 
aggttacctc tcattcctct agaagaaaag 900 
tttatatatg taagtttatt tattataagt 960 
tggatttatt tatagaaaaa ttatctgatg 1020 
tatgataata actatagaaa caagatatct 1080 
a mi 



<400> 16 

ccactctctc caagcttttt ca 22 



<210> 17 

<211> 21 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 17 

caagtctacc tctggtctca t 21 

<210> 18 

<211> 418 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 18 

agaagtgctg ttccctcaat ctgataggtt ccagccttat atgcaggagg tggtgccctt 60 
cctggccagg ctcagcaaca ggctaagcac atgtcatatt gaaggtgatg acctgcatat 120 
ccagaggaat gtgcaaaagc tgaaggacac agtgaaaaag cttggagaga gtggagagat 180 
caaagcaatt ggagaactgg atttgctgtt tatgtctctg agaaatgcct gcatttgacc 240 
agagcaaagc tgaaaaatga ataactaacc ccctttccct gctagaaata acaattagat 300 
gccccaaagc gatttttttt aaccaaaagg aagatgggaa gccaaactcc atcatgatgg 360 
gtggattcca aatgaacccc tgcgttagtt acaaaggaaa ccaatgccac ttttgttt 418 



<210> 19 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 19 

tggccaggaa gggcaccacc t 21 

<210> 20 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 20 

cctatcagat tgagggaaca g 21 

<210> 21 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<221> n is inosine in all cases 

<222> 24,25,29,30,34,35 

<400> 21 

ggccacgcgt cgactagtac gggnngggnn gggnng 

<210> 22 
<211> 20 
<212> DNA 

<213> artificial sequence 

<220> 

<400> 22 

ggccacgcgt cgactagtac 20 



<210> 23 



<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 23 

ccttccccag tcaccagttg 20 

<210> 24 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 24 

taattgttat tcttagcagg 20 

<210> 25 

<211> 690 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 25 

tgcacaagca gaatcttcag aacaggttct 

aattgtctgc aatggccgcc ctgcagaaat 

ccaccagctg cctccttctc ttggccctct 

gctcccactg caggcttgac aagtccaact 

tcatgctggc taaggaggct agcttggctg 

agaaactgtt ccacggagtc agtatgagtg 

acttcaccct tgaagaagtg ctgttccctc 

aggtggtgcc cttcctggcc aggctcagca 

atgacctgca tatccagagg aatgtgcaaa 

agagtggaga gatcaaagca attggagaac 

cctgcatttg accagagcaa agctgaaaaa 

ataacaatta gatgccccaa agcgattttt 

<210> 26 

<211> 4797 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 26 

tgcacaagca gaatcttcag aacaggttct 
aattgtctgc aatggccgcc ctgcagaaat 
ccaccagctg cctccttctc ttggccctct 
gctcccactg caggcttgac aagtccaact 
tcatgctggc taaggaggta tacatctcaa 
atccaaatag ttcttaaact tttcttcaga 
ttatccctga gggtagataa attttctgtt 
ttttttttct tgaacttctt ccttccattt 



ccttccccag tcaccagttg ctcgagttag 60 

ctgtgagctc tttccttatg gggaccctgg 120 

tggtacaggg aggagcagct gcgcccatca 180 

tccagcagcc ctatatcacc aaccgcacct 240 

ataacaacac agacgttcgt ctcattgggg 300 

agcgctgcta tctgatgaag caggtgctga 360 

aatctgatag gttccagcct tatatgcagg 420 

acaggctaag cacatgtcat attgaaggtg 480 

agctgaagga cacagtgaaa aagcttggag 540 

tggatttgct gtttatgtct ctgagaaatg 600 

tgaataacta accccctttc cctgctagaa 660 

690 



ccttccccag tcaccagttg ctcgagttag 60 

ctgtgagctc tttccttatg gggaccctgg 120 

tggtacaggg aggagcagct gcgcccatca 180 

tccagcagcc ctatatcacc aaccgcacct 24 0 

tcctgctctt tctcgttgga tctacttgga 300 

gcatctctaa gagctttagg aacccactgt 360 

ttttcagaga ctctttggga atctggcttt 420 

tggcctttat gatacatatg atgaattttt 480 



cccaaagagc ggccattcag taatccatct 
cattgttcta aactcatgca cacatctgaa 
ggggagacgg gatggggcac atgtctatgt 
agacccttag tcttttcttc tcttccaggc 
tctcattggg gagaaactgt tccacggagt 
ccgtgtgccg tccatgggta cttggggtgg 
tgaccctttc tgtttccctt ccacctgcag 
gtgctgaact tcacccttga agaagtgctg 
atgcaggagg tggtgccctt cctggccagg 
tcagctctca gcctatgccc acctacccct 
cccaactctc tctccttccc cctaccccta 
tgttatcagg agtcatttgg gatcatagag 
ttgagtttat agtggtgaat ggggtctgga 
tcttcggaaa aaaggcaact caggttgcgt 
gctgtggaaa tggatccatt gagtctaagt 
attagaagag aaagtgggaa atgggaaggc 
gccctgttga tgtcatggga agccacaaaa 
catttgaaac tatgaaaaaa agtttgagtg 
actgaagagg gcttaatttt cacatgagat 
caattttctg gagatacgat tgaggtttta 
ctctttccac aaatgcaaac ctcagtagga 
agggaagtga ctggattctg gcgtccaagg 
tgttgaaatc taggtcattg tgggcaaaat 
actgtacctc catgggtgtg gaggttcata 
cttgttattg ttttatagca tattgaaggt 
aagctgaagg acacagtgaa aaaggtagga 
aggagagaca aatgttgttt ttctttcctt 
cttgattctc ctaccaccag ggcgattact 
tatctagatg tcagtttcca aatcttgcaa 
tagcttgtct agtcacataa cctcagattc 
gaatgcaggt ctcctgaatc ccaagccagc 
tggtaccatt aattcttagg gaaatttcag 
tacttgttta aaaacagaaa aatgcctatg 
cattaatgca ttgctttgaa acttggaaga 



gatgattttt ttttccttta tgcctctgtg 540 
ttctgctttt agtctttatg atgttgctct 600 
ataaattttt tttctatttg ctcaatgtcc 660 
tagcttggct gataacaaca cagacgttcg 720 
cagtgtaagc tacagttgtg acgaacaggg 780 
tggtgatgat ggtttaggtc ttatccctta 840 
atgagtgagc gctgctatct gatgaagcag 900 
ttccctcaat ctgataggtt ccagccttat 960 
ctcagcaaca ggctaagcac atgtgtaagt 1020 
ccttccctcc ttccacagag acccccttac 1080 
agctagcagg aagaagtgtc ttggcagcag 1140 
tatttgcttt tgctttgact gagtcacatc 1200 
acttaagtgt acagaagccg cattggtttg 12 60 
aagatgagaa aggtgttggg aaaacatcta 1320 
tgttgagggg aggggatggc atggagagaa 1380 
ttaaagtcgg tggtgggtcg gcagactgtt 1440 
tcggaggcgt gtgaacttga tgccgctgaa 1500 
gagtgggccc agtaaaaggc cctaggactt 1560 
gttttatgta catttcttgt tctaagcatg 1620 
ttccttacag aatttgcata aactactccg 1680 
tttcccaaag atgaagagag gtctcttgta 17 40 
gaattcaaga gctcaggaaa tctaggtcac 1800 
tactaagagc tttaattcca ggtgaattgt 1860 
aagtttcagc acaacattaa gatagttatg 1920 
gatgacctgc atatccagag gaatgtgcaa 1980 
ctgataactg tcaatgctaa gtcatgcaat 204 0 
tctttcttcc catcactttg tgatttttca 2100 
ttggtgtctg tgtatgtaga tatatctata 2160 
attgtagaat tctagaactg gttgggatct 2220 
tggggatggt cagtggcaga gatagggcta 2280 
acttttcccg gtggtgatac agattagttt 2340 
attcctattg actcatgtaa tctgaagaag 2400 
ggcaaattta tttgaagtca tttttgaagt 24 60 
ataaactcag aacaatgaga aaagagctgg 2520 



acttgcatat agggctaatt tctggagtaa 
tatcagatat tgattatagt ttaaaagcaa 
gttcaaatag agtaaaaata ttagtaagag 
ggtaagcttt tttttcttcc tctctcccat 
ctccctcaac aaatccctag ggagcattta 
tgaatgatac catcatgtgg cctatttggt 
acaatagtga ctcaccccaa aaccggagga 
gcaagcaggt acaactaaat actcagaaac 
aacaagctta accttaattc cccctttttc 
ctgagcatca tttaatgagt gtgactgttt 
ttaaattgtg aagcccagtt ctcttgttat 
tgttagcatg ccacagacaa ggcatgcttt 
atcttgcttg ttgtctttag aaaagtgaag 
tgtgattttc aagaccttta atccattttg 
gccatgtgga agagtgatta tgcttttttg 
gcaatgttgt tcagagaaag atcaacagga 
gtggttttgg gaggcattaa ttccctctcg 
gtaaaatgca tgacagacag taggggacga 
tctttgagat agaaaagaat atctttttgg 
aagggcggaa gaaagcagga aaaggaagaa 
aggtttttct tgaaataatg caaatatgat 
ttcacttgaa tttgggtttc ctcttcgatt 
ggagagagaa agaatgaatg ttaggaccta 
gaaaagactt atttggtatt tttcccacaa 
aaaaggtgga aatagaaaaa gccttaatgt 
agtagagatg ttttaaatca ggagtgtcca 
gaattgtctt ggtacacaca taaaatacaa 
tgcataaatc tcatactgtt ttaagaaagt 
ctgtcctggg ccatgtgcgg cctgtgggct 
tgtcatagat agttttggag ctgcaaaaca 
cccccagatc ccagcctcac ttcagtctcc 
tctgcccagc ttttaaacag cttcattagt 
gtggcctctc agtccagaga gccgtcattt 
agggatatta ctatgaatgt tttacaaatg 



taaacactta ttttgaatta tcataatatc 2580 
gagcagacaa ccccgatctc ttttatacag 2 64 0 
atttattata gttaaatgga agtctgaatt 2700 
caagaccttc cattctagtt tcttccttca 27 60 
tccatggtgg gctggtgtac atttctatag 2820 
gaaaagaaca acaatggaag gcttagacta 2880 
atgattagga gcagtgaaag tgacgctctt 294 0 
atgaaggctc cagttgatgg aattttcagt 3000 
cctcttgact ttttaaaaaa gcgtttcttc 3060 
cttcctttga taattgaagg ctttgtagtt 3120 
agaactatta tctagacatg gagggctgaa 3180 
acacatcttg cttaaaaaat tactgatttc 3240 
tgtgagagag gagaatctca tggtgatctg 3300 
aaagaatcaa tttcatattt gcaatgggtt 3360 
ctggtagctt cagaaagcac aggagggaga 3420 
ggagaaactg tcagagctgt ctgaaatagg 3480 
ttgggggtaa aagcagaacg caggttggta 3540 
taaactttaa aattctttat agtcttggag 3 600 
ccttatgtca aaagaagtat ggaaaggtga 3660 
ccatgtatta tatagaggac aatggtgaca 3720 
agattagagg aatttcagta gggaatgctt 37 80 
aagtttggga tcctcatctg catttgactt 3840 
tatctggttt tctattaact aaagcaagtg 3900 
aagtgaaaac ttttctttta ctgtttgtca 3960 
attggtgaat acatggttca aagtcatttg 4020 
atcatttggc ttccctggac caccttgaaa 4080 
gaacaatagc tgatgagcta aaaaagtcca 414 0 
ttatgaattt ctgttagggt gcattcaaag 4200 
gcaggttgga caagctcctt ataagtaatc 42 60 
ggccaaggca taatgggtgg cactcgggat 4320 
ttgctctggt taagaagggg tggtcaactc 4380 
gtgaggtgca cctgaaattg atgcctgctg 4440 
taagctcttt ggcaaatcat acaatactaa 4500 
cttaaaactc ggtttctgtc tccatcaacc 4560 



taatcttgca atttctaatt tgttcacttt 
ttgcattctt attttcacag cttggagaga 
atttgctgtt tatgtctctg agaaatgcct 
ataactaacc ccctttccct gctagaaata 



<210> 27 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 27 

atcagatgga ttactgaatg 20 

<210> 28 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 28 

agctcagcta cagcacagat 20 

<210> 29 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 29 

cctgccccat ttattggcag 20 

<210> 30 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 30 

tgtcctctgc caccctaaca 20 

<210> 31 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 31 

taattcacca ggaccatcat 20 

<210> 32 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 32 

gtggactcag gcaatgatgt 20 

<210> 33 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 33 

acatagagtgt taaagtggg 19 

<210> 34 
<211> 20 



agaaaacatg gcataaatgc tcaaatactt 4 620 
gtggagagat caaagcaatt ggagaactgg 4 680 
gcatttgacc agagcaaagc tgaaaaatga 4740 
acaattagat gccccaaagc gattttt 4797 



<212> DNA 

<213> Homo sapiens 

<220> 

<400> 34 

gctggaaggt ggacagcgag 20 

<210> 35 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 35 

tggcatcgtg atggactccg 20 

<210> 36 

<211> 20 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 36 

tctgctccct gctcctggga 20 

<210> 37 

<211> 20 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 37 

tccaggaggt ctgtagtaat 20 

<210> 38 

<211> 21 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 38 

ctgcctgctt ctcattgccc t 21 

<210> 39 
<211> 21 
<212> DNA 

<213> Mus musculus 

<220> 

<400> 39 

caagtctacc tctggtctca t 21 



<210> 40 
<211> 179 
<212> PRT 

<213> Mus musculus 

<220> 

<400> 40 

Met Ala Val Leu Gin Lys Ser Met Ser Phe Ser Leu Met Gly Thr Leu 
15 io 15 

Ala Ala Ser Cys Leu Leu Leu He Ala Leu Trp Ala Gin Glu Ala Asn 
20 25 30 

Ala Leu Pro Val Asn Thr Arg Cys Lys Leu Glu Val Ser Asn Phe Gin 
35 40 45 

Gin Pro Tyr He Val Asn Arg Thr Phe Met Leu Ala Lys Glu Ala Ser 
50 55 60 

Leu Ala Asp Asn Asn Thr Asp Val Arg Leu He Gly Glu Lys Leu Phe 
65 70 75 80 



Arg Gly Val Ser Ala Lys Asp Gin Cys Tyr Leu Met Lys Gin Val Leu 
85 90 95 

Asn Phe Thr Leu Glu Asp Val Leu Leu Pro Gin Ser Asp Arg Phe Gin 
100 105 HO 

Pro Tyr Met Gin Glu Val Val Pro Phe Leu Thr Lys Leu Ser Asn Gin 
115 120 125 

Leu Ser Ser Cys His He Ser Gly Asp Asp Gin Asn He Gin Lys Asn 

130 135 140 

Val Arg Arg Leu Lys Glu Thr Val Lys Lys Leu Gly Glu Ser Gly Glu 
145 150 155 160 

He Lys Ala He Gly Glu Leu Asp Leu Leu Phe Met Ser Leu Arg Asn 
165 170 175 

Ala Cys Val 

<210> 41 
<211> 179 
<212> PRT 

<213> Mus musculus 

<220> 

<400> 41 

Met Ala Val Leu Gin Lys Ser Met Ser Phe Ser Leu Met Gly Thr Leu 
15 10 15 

Ala Ala Ser Cys Leu Leu Leu He Ala Leu Trp Ala Gin Glu Ala Asn 
20 25 30 

Ala Leu Pro He Asn Thr Arg Cys Lys Leu Glu Val Ser Asn Phe Gin 
35 40 45 

Gin Pro Tyr He Val Asn Arg Thr Phe Met Leu Ala Lys Glu Ala Ser 
50 55 60 

Leu Ala Asp Asn Asn Thr Asp Val Arg Leu He Gly Glu Lys Leu Phe 
65 70 75 80 

Arq Glv Val Ser Ala Lys Asp Gin Cys Tyr Leu Met Lys Gin Val Leu 
85 90 95 

Asn Phe Thr Leu Glu Asp He Leu Leu Pro Gin Ser Asp Arg Phe Arg 
100 105 HO 

Pro Tyr Met Gin Glu Val Val Pro Phe Leu Thr Lys Leu Ser Asn Gin 
115 120 125 

Leu Ser Ser Cys His He Ser Gly Asp Asp Gin Asn He Gin Lys Asn 
130 135 140 

Val Arg Arg Leu Lys Glu Thr Val Lys Lys Leu Gly Glu Ser Gly Glu 
145 150 155 160 

He Lys Ala He Gly Glu Leu Asp Leu Leu Phe Met Ser Leu Arg Asn 
165 170 175 

Ala Cys Val 

<210> 42 

<211> 5935 

<212> DNA 

<213> Mus musculus 

<220> 

<400> 42 



gaattcaagt 


ccacatgcaa 


tcaatccgaa 


tactttgtaa 


attctcttct 


tcaaatatcc 


60 


atctatatag 


tataagttat 


tgtaggatca 


tttaaaaata 


atgttttgag 


acttatgttt 


120 


gcacaagtaa 


aatgtcagag 


agaattagca 


aatgtatagt 


attattttat 


tttaaaaaat 


180 


ctatgcttaa 


aatgtctatt 


agattgttca 


ctactgacat 


ttccaaactt 


aacttgacct 


240 


tggctatgat 


ttcaaccttt 


gtatttgcat 


ctaccataac 


tgtgtgctca 


cttaccatgc 


300 


tatccgacga 


gcatgttccc 


ctgatgtttt 


tgccttttgc 


tctctcgcta 


acaggctctc 


360 


ctctcagtta 


tcaacttttg 


acacttgtgc 


gatcggtgat 


ggctgtcctg 


cagaaatcta 


420 


tgagtttttc 


ccttatgggg 


actttggccg 


ccagctgcct 


gcttctcatt 


gccctgtggg 


480 


cccaggaggc 


aaatgcgctg 


cccatcaaca 


cccggtgcaa 


gcttgaggtg 


tccaacttcc 


540 


agcagccgta 


catcgtcaac 


cgcaccttta 


tgctggccaa 


ggaggtacag 


ctgcatctct 


600 


ttctctccat 


accgccttgc 


catttctctg 


aagcacttgc 


aaactcttta 


ggggcgcttt 


660 


atctccgcag 


gtctcactac 


ctatgttttc 


tgtctcttta 


gagactcttt 


aaggactgga 


720 


tctttttcta tttctatttc aaggtctcag gaccatttcc tatcttggcc ttcaggacac 


780 


atatactgaa 


ttttatctac 


agaggcgcgt 


ttagaaagcc 


acccacgact 


gcaatacttt 


840 


ccatcctgtt 


gtgctctctt 


ctgaactcat 


actctcttgg 


ctactcctga 


gacccactgc 


900 


ggacatacat 


ctctacttac 


aggcttttct 


tccatctcct 


tgtcacccag 


gcacttaggg 


960 


ttttctctct 


ttcaggccag 


ccttgcagat 


aacaacacag 


acgtccggct 


catcggggag 


1020 


aaactgttcc 


gaggagtcag 


tgtaagtcct 


cactgtgatg 


agcagggcta 


gctgcgggag 


1080 


ctggtggacc 


ctctgggata 


gtctgacgta 


tgacccctgc 


tgcttcttgt 


ctacctgcag 


1140 


gctaaggatc 


agtgctacct 


gatgaagcag 


gtgctcaact 


tcaccctgga 


agacattctg 


1200 


ctcccccagt 


cagacaggtt 


ccggccctac 


atgcaggagg 


tggtgccttt 


cctgaccaaa 


1260 


ctcagcaatc 


agctcagctc 


ctgtgtaagt 


ctggctctgg 


ctacctatgc 


tcctctctct 


1320 


tcctcttcta 


ttccagtaag 


aacccgaggt 


cctgccctct 


ctctcttcac 


aagagtgagg 


1380 


agggcctcag 


caccaccacc 


atcataggcc 


acttgaaata 


ggtcacaaag 


gctttggctt 


1440 


caattgagta 


atactttgag 


tttgtattag 


ttaagcttta 


tttgttttat 


ccatggaaag 


1500 


aaatcaactc 


aaattctgta 


ggatgagaaa 


gatgttggga 


acgaaaaaag 


gcctagatag 


1560 


agaaacagat 


ctgctgagta 


cagtacttat 


gggggggggg 


ggcagggggc 


gatatccact 


1620 


gagtccaagt 


acttgttggg 


agagaaatcc 


actgagtaca 


agtacttgtg 


ggggaaggaa 


1680 



tggcacagag caaaagttga agggaaagag gaagatggag aggcctcaat gttgggggtg 1740 
tgaaaggtca ctcctttttc catgtgatgg agagttaaga aaaatcagtg tgtgagtttg 1800 
atgtcttcag acaccccaac tatggcagac tgtgggagac ctggcattta gggaaggcgc 18 60 
ggcttttcac acgagaaact ttatgctcat ctcttgtgct acactcccac ctttgatgag 1920 
gttaagctca ggtttcgttt ctaccgttct tgctactggt ggaaacttca gtaggattcc 1980 
ccaaagacga ggacagctct tctgtaaggg agggacctgg atttcagtgt cctagagaac 2040 



gaaatagctc agagaatcta ggtcaacgtg 
gaacgcctct attccaggtg aacggtcacg 
caccggataa gattctgtta gtgagtctgc 
ccagaacatc cagaagaatg tcagaaggct 
aagccacaat actaagccat tcagtaggag 
ctcttctact ttgtaacatt ttctttgact 
ctgcacctgc atctagctgg gtctatagat 
caattctgga gctagcagaa agcttagctc 
tggcttgtga cagagtcaat gctagaagac 
ctagtggcca cgtgtaatta ctttagcctg 
gacctacata atccgaagaa gcatgcattg 
gagatgattt ttgagctcat taaactgatg 
aacaacaaaa gagctggatt tgcaaatagg 
ctgtcatctt aattaaaata tagtgtctat 
gtggataact tcccaattta ctgggcctgg 
ttaattatag tgtcatgaaa gtatgagttg 
tcttagttat tatttttttt tcttcacacc 
ctgcgagcta ttatatgact ttacagcaaa 
ggaacaggat agcaggaggc tcaggctagc 
tggttgatag cagagaaagt gaggctcttc 
aggaaggctc tggttgatgg aattatcagt 
agaagctaaa ccgtctctcc ttcttgtgtg 
agtgttcatg gctttgcaga ttttcagtgc 
cttgacacct gggcttggat gttagcatgc 
aaaggttatt attcatttac tttgtctttg 
acaggagatg tattctctgt aggaaaactt 
ttcagtcaac tttgactttt ataccatgct 
cgtggctctg ggaaaagcac caatagggga 
agggaaactg ggtcagagct ccccaaagac 
ggtgggttca tataatagaa tggaacagag 
ggtcctaagt ctttaagata caaaatagct 
aaggcagcaa gcattgagag ggagatggaa 
gctacaaatc ctccacgaga ggatttttct 
aggtggatcg cagaaggact tgctttgcca 



aaatctaggt cacagcgggc aaaaatgact 2100 
tgcctcagat atactgaggt attgggctcc 2160 
ttttattttg cagcacatca gtggtgacga 2220 
gaaggagaca gtgaaaaagg tactattggc 2280 
acgtggggat ttctttctct gcttcccagt 2340 
tgtctactgt ctggtccatt actcacttag 2400 
ctttcaatct gtgtctaaat ttgtaagtca 2460 
agccagtctc atgagcactt gctcggagga 2520 
agcatccctg attcccagct ctgcacttgc 2580 
attaagtatt tgggaaagcc aattcccacc 2 64 0 
aaaactagaa agctgggcac aaacttacta 2700 
ctctgaaatg tgatcaaatc aacccagaat 27 60 
acaagtattt agaatcactg gtattaacag 2820 
ttagctgcct atttaagatt aaacacaaga 2880 
tttcaataga gtaaaaatat cagtcataga 2940 
gaaacccttt ccttactttt taccttcatt 3000 
ctgatcaagc cactagtaag cacctatctg 3060 
caacattgct gtgtggcctc tttggggaag 3120 
aagtctggac tcaacctaaa gccagaggca 3180 
acaagtgggt gtgcttaagt aatcagaaac 3240 
aagatatcta cccttatctc cttcttctat 3300 
taggctgata aacacgcttg ttttcttttg 3360 
tctgccagtt cttgttagag ggtttgttac 3420 
caaaggcaca cacttctgaa tgcctgtgta 3480 
gaaaggtgaa gtgtgtgtga gaaagaactc 3540 
ttttttcccc ttaaaagcct ataatccact 3600 
gtcacatgaa agagtgttta ggcccgctct 3660 
agaaatgtta tgccgagaaa tctgactggc 3720 
cactacaggt gttaagtagg aacagtcgag 3780 
ggagggaaga taagctacaa agtttcatag 3840 
ggttgggctt cataacaaag gaagtctggg 3 900 
agggaaaaaa caatgtagag gatttgaaaa 3960 
tggaggaatc tagaacaagg gtggtggatt 4020 
tttgaatctg ggtttttgtc tctccattga 4080 



ggttgagagc gtcacccttt tttaccctgg 
tcctacctgg agttttacta gtttacgcaa 
aagaaaaaaa aaaaaaaaag gaaacctgtt 
gcacaggcag ctgggcatgg tggcccatgc 
aggtgacttt ctaaattcaa ggccagcctg 
gctatacaga gaaaccctgt ctcgggaaaa 
agaagaggag aggagaggag aggagaggag 
aggagaggag aagagaagag aagagaagag 
aagagaagag aagagaagag aagagaagag 
agaaaagaaa aaagcaagca agcaagcact 
ggtctttgag acaaggcttt tgaattgagc 
agggctacct gtcaggccga gccctgctgg 
tcacttcctg ctgcttagca cagttaggag 
aaaatttaat ttacaaaagg cagtgtaatt 
tgtgcattgt gcaaggttca ataaggtaga 
tgaaatgcaa gtaatatagg tagatgcctg 
aaggtcttgg gcaaatcata ttatactcat 
ttttagagaa ggctgatact tggttttggt 
taactagtac cactttagaa aatgctaccc 
atagcttgga gagagcggag agatcaaagc 
tctgagaaat gcttgcgtct gagcgagaag 
gccttctaaa aagaacaata agatccctga 
gctaacgtcc accatcatta gaagatttca 
tagtgtcaag ttgtccatga gaccagaggt 
atattttatt gtcattgata atgcaacaga 
aggaggttac ctctcattcc tctagaagaa 
tactttatat atgtaagttt atttattata 
atatggattt atttatagaa aaattatctg 
atttatgata ataactatag aaacaagata 
taaatcttct gtcttgtaat ttttctccct 
ttacccaatc attctcatga cttcatgctt 

<210> 43 
<211> 179 
<212> PRT 

<213> Homo sapiens 



ataggaggag gaaagaaggg gtgttttgac 4140 
tggaacagac actcgggacc tcctcttgac 4200 
gtttctcttg tttgttcttt tgttaagaaa 4260 
ctttaatccc agcatttggg aggcagaggc 4320 
gtctacaaag tgagttccag gacagccagg 4380 
aaaaaaaaga agaaaagaaa agaaaagaag 4440 
aggagaggag aggagaggag aggagaggag 4500 
aagagaagag aagagaagag aagagaagag 4560 
aagagaagag aagagaaaag aaaagagaaa 4 620 
ggcaaagcat gcccacatgg gacgtatgtg 4 680 
gctcatcaat agttgatcat ggtcaggtgg 4740 
cttagcactt aacatctcca ggtctcagta 4800 
ttgagcaaac ctttttttcc aacccccact 4860 
tgtgggatac agtgtgataa ttgatctatg 4 920 
tcaataggcc catcaacagc tttatgggtg 4980 
tgtgtcctta ggtcagaaag gcatgatttt 5040 
gttaaaaatg cattatgttg attatcaatc 5100 
gctcagcaag caaatgtcac cagctctttc 5160 
gtgctcaaat tggtttgtat tcttattttc 5220 
gatcggggaa ctggacctgc tgtttatgtc 5280 
aagctagaaa acgaagaact gctccttcct 5340 
atggactttt ttactaaagg aaagtgagaa 5400 
catgaaacct ggctcagttg aaagagaaaa 5460 
agacttgata accacaaaga ttcattgaca 5520 
aaaagtatgt actttaaaaa attgtttgaa 5580 
aagcctatgt aacttcattt ccataaccaa 5640 
agtatacatt ttatttatgt cagtttatta 5700 
atgttgatat ttgagtataa agcaaataat 57 60 
tcttaggctt taataaacac atgaatatca 5820 
ttaatatcaa caataccatc atcgtcatca 5880 
gactcatatt atctggtaaa gtttg 5935 



<220> 
<400> 43 

Met Ala Ala Leu Gin Lys Ser Val Ser Ser Phe Leu Met Gly Thr Leu 
15 10 15 

Ala Thr Ser Cys Leu Leu Leu Leu Ala Leu Leu Val Gin Gly Gly Ala 
20 25 30 

Ala Ala Pro lie Ser Ser His Cys Arg Leu Asp Lys Ser Asn Phe Gin 
35 40 45 

Gin Pro Tyr lie Thr Asn Arg Thr Phe Met Leu Ala Lys Glu Ala Ser 
50 55 60 

Leu Ala Asp Asn Asn Thr Asp Val Arg Leu lie Gly Glu Lys Leu Phe 
65 70 75 80 

His Gly Val Ser Met Ser Glu Arg Cys Tyr Leu Met Lys Gin Val Leu 
85 90 95 

Asn Phe Thr Leu Glu Glu Val Leu Phe Pro Gin Ser Asp Arg Phe Gin 
100 105 110 

Pro Tyr Met Gin Glu Val Val Pro Phe Leu Ala Arg Leu Ser Asn Arg 
115 120 125 

Leu Ser Thr Cys His lie Glu Gly Asp Asp Leu His lie Gin Arg Asn 
130 135 140 

Val Gin Lys Leu Lys Asp Thr Val Lys Lys Leu Gly Glu Ser Gly Glu 
145 150 155 160 

lie Lys Ala He Gly Glu Leu Asp Leu Leu Phe Met Ser Leu Arg Asn 
165 170 175 



Ala Cys He 



